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Description 

The present invention relates to ONA molecules responsible tor encoding the multi-functional proteins that direct 
the biosynthesis of polyketide compounds, the products encoded by said ONA molecules, recombinant ONA expression 
vectors, and transformed microbial host cells. 

Polyketides are a family of compounds that include a large number of structurally and functionally diverse natural 
products. For example, the polyketides provide the structural backbone for compounds that exhibit a variety of biological 
activities, such as, antibiotic, antitumor, and immunosuppressive agents. 

Although the polyketides are quite diverse as indicated, they share a common mechanistic scheme of biosynthesis 
"me polyketides are synthesized by the successive condensation of small carboxylic acid residues followed by variable 
reduction steps at the resulting p-keto carbon (i.e., p^arbonyl) moiety in a process that is similar to the synthesis of 
fatty acids. The iterative synthetic process for many of these polyketides is controlled by a complex of large multi- 
functional polypeptides that have distinct sites for the variety of activities that are required. 

The general scheme for polyketide biosynthesis has been reviewed, for example, in Hopwood and Sherman Annu 
'5 Rev. Genet. . 1 990. 24:37 and Katz and Donadio. Annu. Rev. Microbiol.. 1 993. 47:875. 

Naturally-occurring DNA sequences that encode the polyketide synthase'enzymes have generally been found to 
be organized mto repeated subunits, or modules, each of which encodes all the activities required in a single round of 
synthesis, which includes the condensation step itself and the post-condensation processing steps Each activity is 
associated with a distinct site, which contributes to the specificity for the particular camoxylic acid building block that 
is incorporated at each condensation step, or which dictates the particular post<ondensation processing functions 
that will be executed. a 

For example, PCT publication WO 93/1 3663 describes the organization of the gene encoding the polyketide syn- 
thase of Saccharopolyspora erythraea. The gene is organized in modules, with each module effecting one condensation 
step. The precise sequence of chain growth and the processing of the growing chain is determined by the genetic 
information in each module. This PCT application describes an approach for synthesizing novel polyketide structures 
by manipulating in several ways the DNA governing the biosynthesis of the cyclic lactone framework. In order to adapt 
this methodology to other polyketides. however, the DNA molecules directing the biosynthetic processing must first be 
isolated. 

The present invention is directed to the DNA sequence for the gene cluster responsible for encoding tylactone 
synthase, the building machinery of tylactone, which is the basic building block of tylosis As a result the present 
invention enables modification of the DNA sequence so as to alter the type of carboxylic acids incorporated, the number 
of carboxylic acids incorporated, the post^ondensation reactions performed, or any combination thereof, thereby re- 
sulting in novel tylosin-related polyketides. 

The present invention provides a DNA molecule comprising an isolated DNA sequence that encodes a tylactone 
synthase domain. Thus, the present invention provides the DNA molecule of SEQ ID NO:1 and DNA molecules that 
contain submodules thereof. The present invention also provides the products encoded by said DNA molecules re- 
combinant DNA expression vectors, and transformed microbial host cells. 

Figure 1 is a map of the tylactone polyketide synthase region (tylG) of the Streptomyces fradiae DNA (-45 kb) 
Distances in kb are relative to the beginning of tylG. Open reading frames (ORFs) are indicated by arrows. Restriction 
40 sites are denoted as follows: 

E - EcoRI 
B - BamHI 

45 Predicted functional domains are labeled as follows: 

ACP - acyl carrier protein 

AT - acy transferase 

DH - dehydratase 

50 ER - enoylreductase 

KR - ketoreductase 

KR* - ketoreductase-like domain predicted to be inactive 

KS - ketosynthase 

KS' - ketosynthase-like domain in which a glutamine residue is present in the position occupied by an active site 

cysteine in a normal ketosynthase 

TE - thioesterase. 

Figure 2 depicts the biosynthetic pathway for tylactone synthesis. 
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Figure 3 is a map of the two clones that span the whole region of the tylG DNA. 

Figure 4 is a map of the srmG region of the Streptomyces ambofadens DNA. Distances in kb are relative to the 
beginning of srmG, Open reading frames are indicated by arrows. The srmG DNA (0-42 kb) is the platenolide polyketide 
synthase region. Restriction sites are denoted as above with additions as follows: 

5 

AP- Apa\ 
G- Bgh\ 
K - Kpri 
P- Psfl 
10 X - Xhoi 

Predicted functional domains are labeled as above. 

Figure 5 demonstrates the biosynthetic pathway for platenolide synthesis. 
Figure 6 is a map of the two clones that span the whole region of the srmG DNA. 
is Polyketides are a large class of structurally and functionally diverse natural products. A common feature of com- 

pounds in this class is that their synthesis is accomplished under the direction of a complex of multi-functional peptides, 
termed a "polyketide synthase." Molecular genetic analysis of polyketide synthase genes has revealed two distinct 
classes of enzymes operating for different polyketides: 1 ) the aromatics, which are made through an essentially iterative 
process, and 2) the complex polyketides, which comprise several repeats of the same activities arranged in few very 
20 large polypeptides. 

Among the complex polyketide synthase genes, a polyketide synthase includes enzymatic and regulatory activities 
responsible for exercising substrate specificity, catalyzing the condensation of small carboxylic acid building blocks (in 
the form of coenzyme A thioesters) to a growing polyketide carbon chain, and catalyzing the post-condensation 
processing reactions at the p-carbonyl functional group that results from the condensation reaction. 

25 The condensation reaction requires several activities including acyl carrier protein (ACP), p-ketosynthase (KS), 

and acyltransferase (AT). Once a condensation has occurred, the resulting p-carbonyl functional group may be mod- 
ified. Post-condensation activities that may be involved include p-ketoreductase (KR) t dehydratase {OH), and enoylre- 
ductase (ER). Polyketide biosynthesis is terminated by a thioesterase (TE) activity. Whether all, some, or none of these 
activities act after a particular condensation step determines the structure of the final product. 

30 The present invention provides, in particular, the DNA sequence encoding the polyketide synthase responsible for 

biosynthesis of tylactone, i.e., tylactone synthase. Tylactone itself is the polyketide backbone of the commercially sig- 
nificant antibiotic tylosin. The tylactone synthase DNA sequence, which defines the tylactone synthase gene cluster, 
directs biosynthesis of the tylactone polyketide by encoding the various distinct activities of tylactone synthase. 

The gene cluster for tylactone synthase, like other complex polyketide biosynthetic genes whose organization has 

35 been elucidated, is characterized by the presence of several open reading frames (ORFs), each of which contains one 
or more repeated units termed "modules." A module is defined as the genetic element encoding all of the distinct 
activities required in a single round of synthesis, i.e., one condensation step and all the p-carbonyl processing steps 
associated therewith. Each module encodes an ACP, a KS, and an AT activity to accomplish the condensation portion 
of the synthesis, and selected post-condensation activities to effect p-carbonyl processing. The polypeptides encoded 

40 by such modules are termed "synthase units" (SUs). 

Each module is further characterized by the inclusion of submodules that are responsible for encoding the distinct 
activities of tylactone synthase. For purposes of the present invention, the term "sub-module" is defined as the portion 
of the polyketide synthase DNA sequence that encodes a distinct activity, or "domain". Thus, a domain is taken as 
commonly understood to mean that part of the polyketide synthase polypeptide necessary for a given distinct activity. 

45 Organization of the tylactone synthase gene cluster derived from Streptomyces fradiae, is shown in Figure 1 . The 

tylactone synthetic pathway, with indications of the specific carboxylic acid substrates used for each condensation 
reaction and the various post-condensation activities, is shown in Figure 2. 

A preferred DNA molecule comprising the tylactone synthase gene cluster isolated from Streptomyces fradiae is 
represented by SEQ ID NO: 1 . Other preferred DNA molecules of the present invention include the various open reading 

50 frames of SEQ ID NO:1 that encode individual multi-functional polypeptides. These are represented by the following 
nucleotide residues of SEQ ID NO:1 : ORF1 816 to 14234, ORF2 14351 to 19945, ORF3 20010 to 31199, ORF4 31232 
to 36067, and ORF5 36249 to 41774. The predicted amino acid sequences of the various peptides encoded by these 
sequences are shown in SEQ ID NO:2, 3, 4, 5, and 6, respectively. 

Yet other preferred DNA molecules of the present invention include the modules that encode the synthase units, 

55 which include all the activities necessary for a single round of synthesis. These are represented by the following nu- 
cleotide residues of SEQ ID NO:1: Starter Module 942 to 3929, Module 1 3993 to 8471, Module 2 8541 to 13970, 
Module 3 1 441 1 to 1 9666, Module 4 201 36 to 2461 1 , Module 5 24675 to 30902, Module 6 31 337 to 35743, and Module 
7 36360 to 40826. The predicted amino acid sequences of the various synthase units encoded by these modules are 
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represented, respectively, by the following amino acid residues: Starter SU 43 to 1038, SU1 1060 to 2552 and SU2 
2576 to 4385 in SEQ ID NO:2; SU3 21 to 1772 in SEQ ID NO:3; SU4 43 to 1534 and SU5 1556 to 3631 in SEQ ID 
NO:4; SU6 36 to 1504 in SEQ ID NO:5; and SU7 38 to 1526 in SEQ ID NO:6. 

Still other preferred DNA molecules include the various submodules that encode the various domains of tylactone 
s synthase. These submodules are represented by the following nucleotide residues: KS°(s) 942 to 2156, AT(s) 2571 
to 3557, ACP(s) 3675 to 3929, KS1 3993 to 5264, AT1 5631 to 6617, KR1 7410 to 7949, and ACP1 8220 to 8471 of 
Module 1 in SEQ ID NO:1; KS2 8541 to 9812, AT2 10260 to 11246, DH2 11319 to 11876, KR2 12861 to 13415, and 
ACP2 13719 to 13970 of Module 2 in SEQ ID NO:1; KS3 14411 to 15697, AT3 16055 to 17122, DH3 17198 to 17794, 
KR3 18584 to 19138, and ACP3 19415 to 19666 of Module 3 in SEQ ID NO:1; KS4 20136 to 21404, AT4 21771 to 

10 22757, KR'4 23541 to 24077, and ACP4 24360 to 24611 of Module 4 in SEQ ID NO:1 ; KS5 24675 to 25949, AT5 26292 
to 27284, DH5 27360 to 27917, ER5 28767 to 29813, KR5 29829 to 30368, and ACP5 30651 to 30902 of Module 5 
in SEQ ID NO: 1 ; KS6 31 337 to 32608, AT6 32975 to 33961 , KR6 34694 to 35236, and ACP6 35492 to 35743 of Module 
6 in SEQ ID NO:1 ; KS7 36360 to 37631 , AT7 37989 to 38987, KR7 39759 to 4031 3, ACP7 40575 to 40826, and TE7 
41235 to 41333 of Module 7 in SEQ ID NO:1. 

*5 The predicted amino acid sequences of the various domains encoded by these submodules are represented, 

respectively, by KS Q (s) 43 to 447, AT(s) 586 to 914, ACP(s) 954 to 1038, KS1 1060 to 1483, AT1 1606 to 1 934, KR1 
2199 to 2378, and ACP1 2469 to 2552 in SEQ ID NO:2; KS2 2576 to 2999, AT2 3149 to 3477, DH2 3502 to 3687, 
KR2 4016 to 4200, and ACP2 4302 to 4385 in SEQ ID NO:2; KS3 21 to 449, AT3 569 to 924, DH3 950 to 1148, KR3 
1 41 2 to 1 596, and ACP3 1 689 to 1 772 in SEQ ID NO:3; KS4 43 to 465, AT4 588 to 91 6. KR'4 1 1 78 to 1 356, and ACP4 

20 1 451 to 1 534 in SEQ I D NO:4; KS5 1 556 to 1 980, ATS 2095 to 2425, DH5 2451 to 2636, KR5 3274 to 3453, and ACP5 
3548 to 3631 in SEQ ID NO:4; KS6 36 to 459, AT6 582 to 910, KR6 1155 to 1335, and ACP6 1421 to 1504 in SEQ ID 
NO:5; KS7 38 to 461 , AT7 581 to 913, KR7 1171 to 1355, ACP7 1443 to 1526, and TE7 1663 to 1695 in SEQ ID NO:6. 

Although not wishing to be bound to any particular technical explanation, sequence similarity exists among domain 
boundaries in various polyketide synthase genes. Thus, one skilled in the art is able to predict the domain boundaries 

25 of newly discovered polyketide synthase genes based on the sequence information of known polyketide synthase 
genes. In particular, the boundaries of submodules, domains, and open reading frames in the instant application are 
predicted based on sequence information disclosed in the instant application and the locations of the domain bound- 
aries of the erythromycin polyketide synthase (Donadio et al., Gene 111 :51 (1992)). 

The DNA sequence of the tylactone synthase gene was determined from recombinant DNA clones prepared from 

30 the DNA of Streptomyces fradiae. The tylactone synthase gene is contained in recombinant DNA vectors pSET506 
and pSET507 (Figure 3), which are available from the National Center for Agricultural Utilization Research, 1815 North 
University Street, Peoria, Illinois 61604-3999, in E. co//K12 MM294 under accession numbers NRRL B-18688 (depos- 
ited July 1 9, 1 990) and NRRL B-1 8689 (deposited July 1 9, 1 990), respectively. 

Tehcniques for isolating bacterial DNA are readily available and well known in the art. Any such technique can be 

55 employed in this invention. In particular, DNA from these cultures is isolated as follows. Lyophils of E. coli K12 
MM294/pSET506 or E. coli K12 MM294/pSET507 are plated onto L-agar (10 g tryptone, 10 g NaCI, 5 g yeast extract, 
and 1 5 g agar per liter) plates containing 100 u.g/ml ampicillin to obtain a single colony isolate of the strain. This colony 
is used to inoculate about 500 ml of L-broth (10 g tryptone, 10 g NaCI, 5 g yeast extract per liter) containing 100 jig/ 
ml ampicillin, and the resulting culture is incubated at 37° C with aeration until the cells reach stationary phase. Cosmid 

*o DNA is obtained from the cells in accordance with procedures known in the art (see e.g., Rao et al., 1987 in Methods 
in Enzymology, 153:166). 

DNA of the current invention can be sequenced using any known techniques in the art such as the dideoxynucle- 
otide chain-termination method (Sanger, et al., Proc. Natl. Acad. Sci. 74:5463 (1977)) with either radioisotopic or flu- 
orescent labels. Double-stranded, supercoiled DNA was used directly for templates in sequence reactions with se- 

45 quence-specific oligonucleotide primers. Alternatively, fragments were used to prepare libraries of either random, over- 
lapping sequences in the bacteriophage M13 or nested, overlapping deletions in a plasmid vector. Individual recom- 
binant DNA subclones were then sequenced with vector-specific oligonucleotide primers. Radioactive reaction products 
were electrophoresed on denaturing polyacrylamide gels and analyzed by autoradiography Fluorescent- labeled re- 
action products were electrophoresed and analyzed on Applied Biosystems (ABI Division, Perkin Elmer, Foster City, 

so CA 94404) model 370A and 373A or DuPont (Wilmington, DE) Genesis DNA sequencers. Sequence data was assem- 
bled and edited using Genetic Center Group (GCG, Madison, Wl) programs GelAssemble and Seqed or the ABI model 
670 Inherit Sequence Analysis system and the AutoAssembler and SeqEd programs. 

Polypeptides corresponding to a domain, a synthase unit, or a multi-functional polypeptide can be produced by 
expression of the cDNA sequence in a bacteria, for example, using known expression vectors. Alternatively, the 

55 polypeptides mentioned above can be extracted from tylactone-producing bacteria such as Streptomyces fradiae. In 
addition, the techniques of synthetic chemistry can be employed to synthesize the polypeptides mentioned above. The 
procedures and techniques for isolation and purification of homogenous protein or polypeptides are well known in the 
art. 



4 



70 



75 



20 



25 



30 



35 



40 



45 



50 



55 



EP 0 791 655 A2 

^^^^ 

EXAMPLE 1 



5 



EP0 791 655 A2 



by the EcoRl site at 4.0 kb and the BamHl site at 7.5 Kb, the fragments bounded by BamH\ sites at 7 5 12 0 23 4 

27.3, 34.6, 39.5, and 50.5 kb. 

In order to obtain the tylG gene on a single DNA fragment, the 31 .6 kb DNA fragment bounded by the EcoRI site 

c !"f th ° & ' nHI Sit9 at 34 6 ** iS isolated ,rom a P 8 ** 3 ' di9estion of pSETSOS with the restriction enzymes 

s EcoRI and BamHl. The 15.9 kb DNA fragment bounded by the BamHl sites at 34.6 and 50.5 kb is isolated from a 
partial d.gestion of P SET507 with the restriction enzyme BamHl. The resulting fragments are ligated and cloned in an 
appropriate recombinant DNA vector. Clones containing the correct orientation of the two ligated fragments are iden- 
tified by restriction enzyme site mapping. 

10 EXAMPLE 2 

Production of a polyketide hybrid of platenolide and tylactone 

The lactone rings of the polyketides platenolide and tylactone undergo an identical set of post-condensation 
processing steps. However, these two polyketides are synthesized by condensation of more than one type of caiboxylic 
acd and the specific building blocks chosen for incorporation in the two pathways differ. The organization of the pla- 
tenolide gene cluster is shown in Figure 4 along with the accompanying synthetic pathway in Figure 5 The specific 
carboxylK acid substrates that are used for each condensation reaction and the post^ondensation activities of plate- 
nolide synthesis are indicated. 

The DNA sequence of the Streptomyces ambofaciens platenolide synthase (srmG) genes was obtained by se- 
quencing inserts of recombinant DNA subclones containing contiguous or overlapping DNA segments, which when 
considered together span the entire srmG region. All sequences representing srmG are fully contained in the overlap- 
ping cosmid clones pKCIOBO and P KC1 306 (Figure 6). The sequence can be obtained by subcloning and sequencing 
the fragments bounded by NnA sites at position 1 , 0.3 kb, 8.2 kb, 14. 1 kb. 20.2 kb, 29.5 kb, 31 .4 kb, 41 1 kb and 42 0 kb 
In order to obtain the srmG region on a single fragment, the 25.0 kb fragment bounded by the NnA site at position 
li and the SM s.te at 25.0 kb is isolated from a partial digestion of pKC1080 with restriction enzymes A/rut and SfiA 
The 17.8 kb DNA fragment bounded by the SfiA sites at 25.0 kb and 42.8 kb is isolated by digestion of pKC1 306 with 
me restnction enzyme Sft/I. The resulting fragments are ligated and cloned in an appropriate recombinant DNA vector 
Clones containing the correct orientation of the two ligated fragments are identified by restriction enzyme mapping 

An exchange of tylactone polyketide synthase DNA with corresponding platenolide DNA was effected as follows 
in order to generate a novel polyketide structure. 

A strain of Streptomyces ambofaciens, the organism that produces platenolide, was constructed with most of ORF 1 
de eted. This ORF1 -deficient strain produced no detectable platenolide. To confirm that the lack of ORF1 was the only 
deficiency in platenolide production, a construct containing ORF1 , and not any functional activities of ORFs 2-5 was 
introduced into the ORF1 -deficient strain on a vector that contains the site-specific integration function from the strep- 
tomycete phage *C31. Integration of ORF1 at the *C31 att site restored spiramycin production to parental levels 
confirming that ORF1 codes for a functional protein and that ORFs 2-5 are expressed in the ORF1 -deficient strain 

ORF1 of tylGis organized like srmGORFI . DNAcoding for a KS\ an AT, and an ACPfrom fy/ORF1 was exchanged 
with the corresponding region from srm ORF1 by replacing an EcoRI-Apat fragment of sm»ORF1 with an EcdRl-SM 
fragment from fy/ORF1 , generating plasmid pKC1 524. When P KC1 524 was introduced into the ORF1 -deficient strain 
polyketide synthesis was restored. The products produced by this restored strain were indistinguishable from those 
produced by the parental strain of Streptomyces ambofaciens when analyzed by thin-layer chromatography and HPLC 
c™ 6 ,!^ ORF1 DNA sequence is SEQ ID NO:7 and the amino acid sequence of the ORF1 -encoded polypeptide 
is SEQ ID NO:8. The rest of the DNA sequence of the hybrid molecule is identical to original tylG sequence and the 
polypeptides encoded by this remaining portion are therefore identical as well. 

In order to simplify physical analysis of the putative novel polyketide, an Xhd fragment encoding sugar synthesis 
or addition was deleted from the restored. Deletion of this fragment led to a strain that produced only the lactone without 
any sugar res.dues added. When analyzed by NMR and mass spectroscopy, the isolated product made by this sugar- 
deficient strain was confirmed as 16-methylplatenolide, which is the structure that would be predicted from the incor- 
poration of a propionate in place of the acetate normally utilized in platenolide synthesis 
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(1) GENERAL INFORMATION: 

75 (i) APPLICANT: ELI LILLY AND COMPANY 

(B) STREET: Lilly Corporate Center 

(C) CITY: Indianapolis 
<D] STATE: Indiana 

(EJ COUNTRY: United States of America 
(F) ZIP: 46285 

20 

(ii) TITLE OF INVENTION: POLYKETIDE SYNTHASE GENES 

(iii) NUMBER OF SEQUENCES: 15 

(iv) CORRESPONDENCE ADDRESS: 

25 (A) ADDRESSEE: K. G. TAPPING 

(B) STREET: Erl Wood Manor 

(C) CITY: Windlesham 

(D) STATE: Surrey 

(E) COUNTRY: United Kingdom 

(F) ZIP: GU20 6PH 

30 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: Macintosh 

(C) OPERATING SYSTEM: Macintosh 7.0 

(D) SOFTWARE: Microsoft Word 5.1 

35 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEM3TH: 43280 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

50 iix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 816.. 14234 

(ix) FEATURE: 
55 (A) NAME/KEY: CDS 

(B) LOCATION: 14351.. 19945 
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15 



20 



CGC GGT ATG CCC ACC CGT CAG GAC GAC GGC ACC TGG COG GCC GCC CTG 1073 
Arg Gly Met Pro Thr Arg Gin Asp Asp Gly Thr Trp Arg Ala Ala Leu 
5 75 80 8S 

GAG GAC CAC GCC GGC TTC GAC GCC GGC TTC TTC GGC ATG AAC GCC CGG 1191 
Clu Asp His Ala Gly Phe Asp Ala Gly Phe Phe Gly ™ £Z £a S 
90 95 100 

, 0 CAC CCC CCC GCC ACC GAC CCG CAG CAC CGA CTG ATG CTG GAA CTC GGA 

Cln Ala Ala Ala Thr Asp Pro Cln His Arg Leu Met Leu Glu Leu Gly 
105 110 H5 * 

TCC GAG GCA CTG GAG GAC GCC GGC ATC CTC CCC GGC GAT CTC ACC GGC 
Trp Glu Ala Leu Glu Asp Ala Gly lie Val Pro Gly Asp Leu Thr Gly 
120 125 130 

ACC CAC ACC GGA CTC TTC GCC GCC CTG CCG TCC GAC GAC TAT GCC GTT 
Thr Asp Thr Gly Val Phe Ala Gly Val Ala Ser Asp Asp Tyr £ Sal 
b 140 145 150 

CTC ACC CGC CGT TCC GCC CTC TCC GCC GGG GGA TAC ACC GCC ACC GGG 
Leu Thr Arg Arg Ser Ala Val Ser Ala Gly Gly Tyr Thr Ala Thr S£ 
155 160 165 

S »f* ^ ^ C ^ ^ C ^ AAC CGC CTC TCC CAC TTC CTG GGC CTG 
Leu His Arg Ala Leu Ala Ala Asn Arg Leu Ser His Phe Leu Cly Leu 
170 "5 is© 

CGC GGC CCC AGC CTG GTC CTC GAC TCC GCC CAG TCC GCC TCA CTC GTG 
Arg Gly Pro Ser Leu Val Val Asp Ser AU Gin Ser Ala Ser Leu Val 

30 18!> 190 

S E 2? SI ~ ™ ™ «» « CGG GGT GAG ACG TCC CTC 

Gly 
210 

^^^^^^^ CWATC ^*^<^GAGAGCACCACC 
Ala Val Ala Gly Gly Val Asn Leu lie Leu Thr Glu Glu X £ J£ 

220 225 230 



35 



SO 



Aia vai Gin 71., rr: rs x, t ™ m 000 ^ ^ acg crc 

Ala Val Gin Leu Ala Cys Glu Ser Leu Arg Arg Gly Glu Thr Ser Leu 

^uu 205 



1169 



1217 



1265 



1313 



1361 



1409 



1457 



1505 



~ «« „„ to , jj oly „. ^- ss^sssissss 

235 240 245 

TTC CAC GCC CGC GCC AAC GGC TAC GTA CGC GGC GAG CGC CCC GGA GCC 
Phe Asp Ala Arg Ala Asn Gly Tyr Val Arg Cly Clu Gly Gly Gly S 
45 255 260 

Val vTl 7^ f*° o° A GAC 000 GCA 010 ^ 0X0 000 GAC CGC GTC 

Val Val Leu Lys Pro Leu Asp Ala Ala Leu Ala Asp Gly Asp Arg Val 
265 270 27S 

TAC TCC GTC ATC AAG GGA GGT GCC GTC AAC AAC GAC GGC GGC GGC GCG 
Tyr Cys Val lie Lys Gly Gly Ala Val Asn Asn Asp SJ SJ gj 2 

285 290 

S 2 JS J? f ° ? AC ^ GAG 000 00 GAA 007 GTC CTC CGC CAG 174S 
Ser Leu Thr Thr Pro Asp Arg Glu Ala Gin Glu Ala Val Leu Arg Gin 

300 305 310 



1601 



1649 



1697 
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GCC TAC COG COG GCG GGC CTC AGC ACC GGC GCC CTC CGC TAC GTC GAG 
Ala Tyr Arg Arg Ala Gly Val Ser Thr Gly Ala Val Arg Tyr Val Glu 
315 320 32S 

CTG CAC GOG ACC GGC ACC CGG GCC GCC GAC CCC GTC GAG GCG GCC CCA 
Leu His Gly Thr Gly Thr Arg Ala Gly Asp Pro Val Glu Ala Ala Ala 
330 335 340 

ctgcxx:gccgtgctcggggcgggggcggacagcggccgcagcacgccg 

Leu Gly Ala Val Leu Gly Ala Gly Ala Asp Ser Gly Arg Ser Thr Pro 
345 350 355 

CTC GCC GTC GGC TCG GTC AAG ACC AAC GTC GGC CAT CTG GAG GGC GCG 
Leu Ala Val Gly Ser Val Lys Thr Asn Val Gly His Leu Glu Gly Ala 
360 365 370 

CCC GGC ATC CTC CCA CTC ATC AAG GCC ACG CTC TCC GTA CGG AAG GGC 
Ala Gly lie Val Gly Leu lie Lys Ala Thr Leu cys Val Arg Lys Gly 
375 380 385 390 

GAA CTC GTC CCC AGC CTC AAC TTC AGC ACG CCG AAC CCT GAC ATC CCC 
Glu Leu Val Pro Ser Leu Asn Phe Ser Thr Pro Asn Pro Asp He Pro 
395 400 405 

CTC GAC GAC CTC CGG CTC CGC GTC CAG ACC GAA CGG CAG GAG TOG AAC 
Leu Asp Asp Leu Arg Leu Arg Val Gin Thr Glu Arg Gin Glu Trp Asn 
410 415 420 

GAG GAG GAC GAC CGG CCG CGC GTG GCC GCC GTC TCC TCC TTC GGT ATG 
Glu Glu Asp Asp Arg Pro Arg Val Ala Gly Val Ser Ser Phe Gly Met 
*25 430 435 

GGC GGA ACC AAT GTC CAC CTC GTC ATC GCG GAG OCT CCG GCC GCC GCG 
Gly Gly Thr Asn Val His Leu Val lie Ala Glu Ala Pro Ala Ala Ala 
440 445 450 

GOG TCC TCC CGC CCG GGG GGT TCC GGC GCT GGT TCC GGT GCC GGT ATC 
Gly Ser Ser Gly Ala Gly Gly Ser Gly Ala Gly Ser Gly Ala Gly S 
,S5 460 465 470 

Ser Ala Val Ser Gly Val Val Pro Val Val Val Ser Gly Arg SeT Arg 
475 480 485 

Val Val Val Arg Clu Ala Ala Gly Arg Leu Ala Clu Val Val Glu Ala 
490 495 500 

2f ,7? 0(30 01X5 000 0X7 01X3 000 GTS ACG ATC GCG GAC CGG TCG 

Cly Gly Val Gly Leu Ala Asp Val Ala Val Thr Met Ala Asp Arg Ser 
b05 510 515 

At-!t Ph" 1 " < r^ G CTG GCT CGG GGT GAG GAG CTT 

Arg Phe Gly Tyr Arg Ala Val Val Leu Ala Arg Gly Glu Ala Glu Leu 

wo 525 530 

GCC GGG CGT TTC CCG GCG TTC GCG GGG GGT GAT CCG GAC GCG GGT GTC 
Ala Gly Arg Leu Arg Ala Leu Ala Cly Cly Asp Pro Asp Ala Gly Val 
540 545 550 
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GTC ACC GCT GCG GTT CTC GAC OCT OCT GTG GTT GTC GGT GCT GCC CCC 
Val Thr Gly Ala Val Leu Asp Gly Gly Val Val Val Gly Ala Ala Pro 
555 560 565 

GGC OCT GCC GGT GCT GCC GCT OGT GCC GGT GCT GCC GCT GGT GCC GCT 
Gly Gly Ala Gly Ala Ala Gly Gly Ala Gly Ala Ala Gly Gly Ala Gly 
570 575 580 

GGT GGG GGC GTG GTG TTC GTT TTC CCT GGT CAG GGG ACG CAG TGG GTC 
Gly Gly Gly Val Val Leu Val Phe Pro Gly Gin Gly Thr Gin Trp Val 
585 590 595 

GGG ATG GGT GCG GGG CTG CTG GGG TCT TCG GAG GTG TTT GCG GCG TCG 
Gly Met Gly Ala Gly Leu Leu Gly Ser Ser Glu Val Phe Ala Ala Ser 
600 605 610 

ATG CGG GAG TGT GCG CGG GCG CTG ACT GTT CAT GTG GGG TGG GAT TTG 
Met Arg Glu Cys Ala Arg Ala Leu Ser Val His Val Gly Trp Asp Leu 
615 620 625 630 

CTG GAG GTG GTG TCG GGC GGG GCC GGG TTG GAG CGG GTC GAT GTC GTC 
Leu Glu Val Val Ser Gly Gly Ala Gly Leu Glu Arg Val Asp Val Val 
635 640 645 

CAG CCG GTG ACG TCG GCG GTG ATG GTC TCG CTC GCC CGG TAC TGG CAG 
Gin Pro Val Thr Trp Ala Val Met Val Ser Leu Ala Arg Tyr Trp Gin 
650 655 660 

GCG ATG GCT GTC GAC GTC GCT GCG GTC GTC GGT CAT TCC CAG GGG GAG 
Ala Met Gly Val Asp Val Ala Ala Val Val Gly His Ser Gin Gly Glu 
665 670 675 

ATC GCC GCT GCC ACG GTC GCG GGG GCG TTC TCG CTC GAG GAT GCG GCG 
lie Ala Ala Ala Thr Val Ala Gly Ala Leu Ser Leu Glu Asp Ala Ala 
680 685 690 



GCT GTC GTC GCT CTC CGG GCG GGG TTC ATT GGC CGG TAT CTC GCG OCT 
Ala Val Val Ala Leu Arg Ala Gly Leu He Gly Arg Tyr Leu Ala Gly 
700 705 710 



CCT OCT GCG ATC GCG GCT GTT CCG CTC CCT CCC GGC GAG GTC GAG GCC 
Arg Gly Ala Met Ala Ala Val Pro Leu Pro Ala Gly Glu Val Glu Ala 
715 720 725 

GGG CTC GCG AAG TCG CCG GCT GTC GAG GTC GCG GCG GTC AAC GGT CCG 
Gly Leu Ala Lys Trp Pro Gly Val Glu Val Ala Ala Val Asn Gly Pro 
730 735 740 

GCG TCT ACG GTC GTT TCC GGG GAT CGG CGG GCG GTC GCC GGT TAT GTC 
Ala Ser Thr Val Val Ser Gly Asp Arg Arg Ala Val Ala Gly Tyr Val 
745 750 755 

GCC GTC TCT CAG GCG GAG GCT GTC CAG GCC CGG TTC ATA CCG GTC GAC 
Ala Val Cys Gin Ala Glu Gly Val Gin Ala Arg Leu He Pro Val Asp 
760 765 770 

TAC GCC TCT CAC TCC CGC CAT GTG GAG GAC CTC AAG GGC GAG TTC GAG 
Tyr Ala Ser His Ser Arg His Val Glu Asp Leu Lys Gly Glu Leu Glu 
77 5 780 785 790 
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nil 9 9X0 ** g Ser Pro v «l Pro Val Cys 

795 800 80S 



S fS £5 ??* 2° ^ CCG 000 "O «C GAT GCG GOG 

Ser Thr V«l Ala Gly Glu Gin Pro Gly Glu Pro Val Phe Asp Ala Gly 

815 820 

T^SZ^f^^^^^^^^GCGGTGCTC 
I2I Arg A « n V.1 Glu Phe Ser Ala Sal Val 

830 835 



845 850 

865 870 

5 880 885 

His Ara I^n ^° TCC ACC «* GAG GCC TGG CCC CAC GGC GCC ACC 

taU JjJ Ser «« Ala Glu Ala Trp AU His S£ Sa t£ 

895 900 

910 915 

925 930 

945 950 



960 965 

975 980 
990 995 

E 2£ S S S Su 2 S f c AGG 5*° CTC ACC GAG Acc 

1000 y iooc* 19 Asn **» Leu Uu Th * Glu Thr Gly 

1005 1010 
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CTG GCG CAG TTC CTC CAC ACC GAG TTG CTC GAC GGC TCC CCC TCR CRT 
Leu Ala Gin Phe Leu His Thr Glu Leu Leu Asp cly Ser Pro S c3S 
103S 1040 104 s 

TCC GTC CTC GCA CCG GCG CAG AAG AGC TTC GAA GCC CAG GAC CCG ATT 
Ser V.1 Leu Ale Pro Ala Cln Ly. Ser Phe Glu Ala Gin Clu J£ S 
1050 1055 1060 

GCG GTG CTG OCT ATG GCG TGC CGC TTC CCC OCT GGG GTC OCT TCC CCG 
Ala v.l Val Gly Met Gly Cys Arg Phe Pro Cly Gly Val Gly Ser Pro 
1065 1070 1075 

GAG GCG TTG TGG CGG TTG GTG GTG GAG GGG GTG GAC GCG GTT TCC CCG 
Glu Ala Leu Trp Arg Leu v.l Val Glu Gly Val Asp Ala Val Ser Pro 
1080 1085 1090 

TTT CCC GGT GAT CGT GGC TGG GAT GTG GAG GGG TTG TAC GAC CCG GAG 
Phe Pro Gly Asp Arg Gly Trp Asp Val Glu Gly Leu £r Sp So oTu 
5 1100 1105 1110 

CCG GOT GTC CCG GGG AAG TCG TAT GTC CGG GAG GGG GOT TTT CTG CAT 
Pro Gly Val Ala Gly Lys Ser Tyr Val Arg Glu Gly G^ Phe Leu His 
1115 H20 1125 

GAT GCG GCG GAG TTC GAT GCG GAG TTC TTC GGG ATT TCG CCG CGT GAG »i. 
Asp Ala Ala Glu Phe Asp Ala Glu Phe Phe Gly l£ Ser £o 2£ Glu 
H30 ins 1140 



20 



35 



50 



r .." 7 lro CGG GGT GGC GAG TGT GAC ATT. av 

Leu His Leu Ala Val Gin Ala Leu Arg Gly Gly Glu CyT J£ ™ £a 
v 1245 



3953 



4001 



4049 



4097 



4145 



4193 



4289 



4337 



4385 



??° ^T? ?f° ATC ° AT 000 CAG CAG CGG CTC TO CTC GAG ACC TCC TCG 
Val AlaMet Asp Pro Gin Gln^Arg Leu Leu Leu S£ J£ Sr ^ 

GU> ^° ™ «« CGG GCG GGT ATC GAC CCG CAC TCG CTC CAC GGC AGC 
Glu Ala lie Glu Arg Ala Gly He Asp Pro His Ser Leu His S£ J« 
iOV 1165 1170 

Ara Thr GIv ns^ ^ ATC CCG CAG GAA TAC CCT 

Arg_l*r Gly Val lyr Ala Gly Val Met Pro Gin Glu Tyr Gly Pro Arg 

1185 1190 

1195 1200 12 05 

TCG GGG ACT GTG GTG TCG GGG CGT GTC GCC TAC ACG CTG GGG CTG GAG 
Ser Gly Ser Val Val Ser Gly Arg Val Ala Tyr Thr Leu C^ C^ 
4S 1210 1215 1220 

COT CCC GCC GTC ACC GTG GAT ACG GCG TGT TCG TCG TCG TTG CTC CCC 
Cly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser Sr £u £? X 

1230 X235 

£u ££ S H S3? ^ ?f= ?^ CGG GGT GGC GAG TGT GAC ATG GCG 

1250 



4481 



4529 



4577 



TTC GCC GGT GGT GTG ACG GTC ATC GCC GGG CCG GGG ATC TTC GTC GAG azk 
Leu Ala Gly Gly Val Thr Val Met Ala Gly Pro Gly £ SS 

1260 1265 12?0 
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TTTTCGCGGCAGCGCGGGTTCGCGCKCGATGGGCG^ 4673 
Phe Ser Arg Gin Arg Gly Leu Ala Ala Asp Gly Arg Cys Lys Ala Phe 
1275 1280 1285 

GCG GAT GGG GCG GAT GGG ACC GCT TCG GCC GAG GGT GCG GGG GTC GTC 4721 
Ala Asp Gly Ala Asp Gly Thr Ala Trp Ala Glu Gly Ala Gly Val Val 
1290 1295 1300 

10 CTC GTC GAG COG TTC TCG GAT GCC CGG CGG TTG GGG CAT CCG GTC TTC 4769 

Leu Val Glu Arg Leu Ser Asp Ala Arg Arg Leu Gly His Pro Val Leu 
1305 1310 1315 

GCG GTC GTC TCT GGG TCG GCG GTC AAT CAG GAC GGT GCG TCG AAT GGT 4817 
Ala Val Val Cys Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly 
75 1320 1325 1330 

TTC ACG GCG CCG ACT GGT CCG TCG CAG GAG CGG GTC ATT CGT CAG GCG 4865 
Leu Thr Ala Pro Ser Gly Pro Ser Gin Glu Arg Val He Arg Gin Ala 
1335 1340 1345 1350 

TTC GGG AAT GCG CGG TTC ACG GTC GCG GAT GTC GAT GTC GTC GAG GCG 4913 
Leu Gly Asn Ala Arg Leu Thr Val Ala Asp Val Asp Val Val Glu Ala 
1355 1360 1365 

CAT GGG ACG GGG ACG CGG CTC GGT GAT CCG ATC GAG GCG CAG GCG TTC 4961 
His Gly Thr Gly Thr Arg Leu Gly Asp Pro lie Glu Ala Gin Ala Leu 
1370 1375 1380 

CTC GGG ACG TAT GGG CGG GAT CGT GAT GGT GGG CGT CCG GTC TOG TTC 
Leu Gly Thr Tyr Gly Arg Asp Arg Asp Gly Gly Arg Pro Val Trp Leu 
30 1385 1390 1395 

GGG TCG TTC AAG TCG AAT ATT GGT CAT GCT CAG GCG GCT GCG GGG GTC 5057 

7 LyS Ser Asn Ile His Ala Gin Ala Ala Ala Gly Val 

1400 1405 1410 

at^ JT"? AAG ATC GTG ™* 000 ATG 000 TAT GGG 7(30 V ^ G 000 5105 

Ala Gly Val Ile Lys Met Val Leu Ala Met Arg Tyr Gly Trp Leu Pro 

1415 1420 1425 1430 

CGG ACG TTC CAT CTC GAT GAG CCG AGC CGG CAT GTC GAC TCG TCG GCT 5153 
Arg Thr Leu His Val Asp Glu Pro Ser Arg His Val Asp Trp Ser Ala 
1435 1440 1 445 

GGT GGT GTC TCG TTC CTC ACC GAG GCG CGG GAG TCG CCG GGG GTC GAC 5201 
Gly Gly Val Trp Leu Leu Thr Glu Ala Arg Glu Trp Pro Gly Val Asp 
1450 1455 1460 

CGG CCG CCT CCG CCG GCG GTC TCC GCC TTT GGT GTC ACT GGT ACC AAC 5249 
Arg Pro Arg Arg Ala Ala Val Ser Ala Phe Gly Val Ser Gly Thr Asn 
1465 1470 1475 

GCC CAT CTC ATC CTC GAA GCC CCC GAC ACC GCC GAG GCG GAG AGC GCC 5297 
Ala His Leu Ile Leu Glu Ala Pro Asp Thr Ala Glu Ala Glu Ser Ala 
1480 1485 1490 

ACG ACC CCG GTC CGC TCT GAG GTC TCG GAG TCT GCT GCG GTC CTC GAT 5345 
55 Thr Thr Pro Val Arg Ser Glu Val Ser Glu Ser Ala Ala Val Leu Asp 

1495 1500 1505 15 lo 
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GCC CGC AGT GGT GTG GTC CCG GTG GTG GTT TOG GGG CGT TCG CGG GTG 5393 
Ala Arg Ser Gly Val Val Pro Val Val Val Ser Gly Arg Ser Arg Val 
1515 1520 1525 

GTG GTG CGG GAG GCT GCG GGC CGG TTG GCG GAG GTG GTG GAG GCC GGT 5441 
Val Val Arg Glu Ala Ala Gly Arg Leu Ala Glu Val Val Glu Ala Gly 
1530 1535 1540 

GGT GTG GGG CTG GCG GAT GTG GCG GTG ACG ATG GCG GGC CGG TCG CGG 5489 
Gly Val Gly Leu Ala Asp Val Ala Val Thr Met Ala Gly Arg Ser Arg 
1545 1550 1555 



TTT GGG TAT CGG GCG GTT GTG CTG GCT CGG GGT GAG GCT GAG CTT GCC 5537 
Phe Gly Tyr Arg Ala Val Val Leu Ala Arg Gly Glu Ala Glu Leu Ala 
15 1560 1565 1570 

GGG CGT TTG CGG GCG TTG GCG GGG GGT GAT CCG GAC GCG GGT GTG GTC 5585 
Gly Arg Leu Arg Ala Leu Ala Gly Gly Asp Pro Asp Ala Gly Val Val 
1575 1580 1585 1590 

20 

ACC GGT GCG GTG GTG GAC CCG GAG ACG GGG TCC GGT GGT GGG GGC GTG 5633 
Thr Gly Ala Val Val Asp Pro Glu Thr Gly Ser Gly Gly Gly Gly Val 
1595 1600 1605 

GTG TTG GTT TTC CCT GGT CAG GGG ACG CAG TGG GTG GGG ATG GGT GCG 5681 
Val Leu Val Phe Pro Gly Gin Gly Thr Gin Trp Val Gly Met Gly Ala 
1610 1615 1620 

GGG CTG CTG GGG TCT TCG GAG GTG TTT GCG GCG TCG ATG CGG GAG TGT 5729 
Gly Leu Leu Gly Ser Ser Glu Val Phe Ala Ala Ser Met Arg Glu Cys 
30 1625 1630 1635 

GCG CGG GCG CTG AGT GTT CAT GTG GAG TGG GAT TTG CTG GAG GTG GTG 5777 
Ala Arg Ala Leu Ser Val His Val Glu Trp Asp Leu Leu Glu Val Val 
1640 1645 1650 

TCG GGC GGG GCC GGG TTG GAG CGG GTG GAT GTG GTG CAG CCC GTG ACG 5825 
Ser Gly Gly Ala Gly Leu Glu Arg Val Asp Val Val Gin Pro Val Thr 
1655 1660 1665 1670 

TGG GCG GTG ATG GTG TCG CTG GCC CGG TAC TGG CAG GCG ATG GGT GTG 5873 
40 Trp Ala Val Met Val Ser Leu Ala Arg lyr Trp Gin Ala Met Gly Val 

1675 1680 1685 

* GAC GTG GCT GCG GTG GTG GGT CAT TCC CAG GGG GAG ATC GCT GCT GCC 5921 
Asp Val Ala Ala Val Val Gly His Ser Gin Gly Glu He Ala Ala Ala 
1690 1695 1700 



ACG GTG GCG GGG GCG TTG TCG CTG GAG GAT GCG GCG GCT GTG GTC GCT 5969 
Thr Val Ala Gly Ala Leu Ser Leu Glu Asp Ala Ala Ala Val Val Ala 
1705 1710 1715 



50 CTG CGG GCG GGG TTG ATT GGC CGG TAT CTG GCG GGT CGT GGT GCG ATC 

Leu Arg Ala Gly Leu He Gly Arg Tyr Leu Ala Gly Arg Gly Ala Met 
1720 1725 1730 



6017 



GCG GCT GTT CCG CTG CCT GCC GGC GAG GTC GAG GCC GGG CTG GCG AAG 6065 
55 Ala Ala Val Pro Leu Pro Ala Gly Glu Val Glu Ala Gly Leu Ala Lys 

1735 1740 1745 1750 
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TGG COG GGT GTC GAG GTC GCG GCG GTC AAC GGT CCG GOG TCC ACG GTC 6113 
Trp Pro Gly Val Glu Val Ala Ala Val Asn Gly Pro Ala Ser Thr Val 
1755 1760 1765 

GTT TCC GGG GAT CGG CGG GCG GTG GCC GGT TAT GTC GCC GTC TCT CAG 6161 
Val Ser Gly Asp Arg Arg Ala Val Ala Gly Tyr Val Ala Val Cys Gin 
1770 1775 1780 

GCG GAG GGT GTC CAG GCT CGG TTC ATA CCG GTC GAC TAC CCC TCT CAC 6209 
Ala Glu Gly Val Gin Ala Arg Leu lie Pro Val Asp Tyr Ala Ser His 
1785 1790 1795 



6305 



6353 



TCC CGC CAT GTC GAG GAC CTG AAG GGC GAG TTC GAG CGG GTC CTG TCC 6257 
Ser Arg His Val Glu Asp Leu Lys Gly Glu Leu Glu Arg Val Leu Ser 
,5 1800 1805 1810 

GGT ATC CGC CCC CGC AGT CCG CGG GTC CCG GTC TCT TCC ACC GTC GCC 
Gly lie Arg Pro Arg Ser Pro Arg Val Pro Val Cys Ser Thr Val Ala 
1815 1820 1825 1830 

GGA GAG CAG CCG GGC GAG CCG GTT TTC GAT GCG GGG TAT TCG TTC CGT 
Gly Glu Gin Pro Gly Glu Pro Val Phe Asp Ala Gly Tyr Trp Phe Arg 
1835 1840 1845 

AAT CTG CGG AAC CGG GTT GAG TTC TCC GCG GTG GTC GGT GGT TTC TTC 6401 
Asn Leu Arg Asn Arg Val Glu Phe Ser Ala Val Val Gly Gly Leu Leu 
1850 1855 I860 

GAG GAG GGC CAC CGT CGG TTC ATC GAG GTC AGT GCC CAC CCG GTA CTC 6449 
Glu Glu Gly His Arg Arg Phe He Glu Val Ser Ala His Pro Val Leu 
1865 1870 1875 

GTC CAT GCG ATC GAG CAG ACG GCC GAG GCC GCG GAC CGG AGT GTC CAT 6497 
Val His Ala He Glu Gin Thr Ala Glu Ala Ala Asp Arg Ser Val His 
1880 1885 1890 

GCC ACC GGG ACC CTC CGC CGC CAG GAC GAC AGC CCG CAC CGC CTC CTC 6545 
Ala Thr Gly Thr Leu Arg Arg Gin Asp Asp Ser Pro His Arg Leu Leu 
1895 1900 1905 1910 

ACC TCC ACC GCC GAG GCC TGG GCC CAC GGC GCC ACC CTC ACC TCG GAC 6593 
40 Thr Ser Thr Ala Glu Ala Trp Ala His Gly Ala Thr Leu Thr Trp Asp 

1915 1920 1925 

CCC GCC CTC CCC CCA GGC CAC CTC ACC ACC CTC CCC ACC TAC CCC TTC 6641 
Pro Ala Leu Pro Pro Gly His Leu Thr Thr Leu Pro Thr Tyr Pro Phe 
I 930 1935 1940 



6689 



AAC CAC CAC CAC TAC TCG CTC GAC ACC ACC CCC ACC ACC CCC GCG ACG 
Asn His His His Tyr Trp Leu Asp Thr Thr Pro Thr Thr Pro Ala Thr 
1945 1950 1955 

ACC ACC CAG AGC CCC ACC GAT GCC TCG CGC TAC CGC GTC ACC TCG AAA 6737 
Thr Thr Gin Ser Pro Thr Asp Ala Trp Arg Tyr Arg Val Thr Trp Lys 
I960 1965 1970 y 

GCC CTC ACC GAA TCC TCC CCC GTC CGC CCT CAC TCC ATC GGT CGC TCC 6785 

55 ™ L * U Thr Glu Ser Ser Pro Val Ar 9 Pro His Ser He Gly Arg Cys 

1975 1*80 1985 1990 
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CTC CTC GTT GCA CCC CCG ACC ACC CAC GGC GAG CTC CTC GAC OGA CTG 
Leu Leu Val Ala Pro Pro Thr Thr Asp Gly Glu Leu Leu Asp Gly Leu 
1995 2000 2005 

ACA ACG GTG TTG TCC GAG CGC OCT GCC TCC GTC GCC CCC CTT GAG CTG 
Thr Thr Val Leu Ser Glu Arg Gly Ala Ser Val Ala Arg Leu Glu Val 
2010 2015 2020 

CCC ATC GGC CCG CCG CGT GCC GAG GTC GCC GAA CTG CTC AAG CCC TCC 
Pro He Gly Ala Arg Arg Ala Glu Val Ala Glu Leu Leu Lys Pro Ser 
2025 2030 2035 

ATG GAG TCA GCG GGG GAG GAG AAC ACC ACC GTC GTC TCC CTT CTC GGT 
Met Glu Ser Ala Gly Glu Glu Asn Thr Thr Val Val Ser Leu Leu Glv 
2040 2045 2050 

CTG GTC CCC TCC ACG GAC GCG GTC AGG ACG TCC ATA GCC CTC CTC CAG 
Leu Val Pro Ser Thr Asp Ala Val Arg Thr Ser lie Ala Leu Leu Gin 
2055 2060 2065 2070 

GCC GTC TCC GAC ATC GCC GTC CCC GCC GCC AGG GTC TOG GCG CTG ACC 
Ala Val Ser Asp lie Gly Val Pro Ala Ala Arg Val Trp Ala Leu Thr 
20? 5 2080 2085 

CGG AGG GCC GTG GCC GTG GTT CCC GGG GAG ACG CCG CAG GAC GCG GGG 
Arg Arg Ala Val Ala Val Val Pro Gly Glu Thr Pro Gin Asp AU S 
2090 2095 2100 

of 7* ^ ^ TO °°* 01X1 00(5 «* GAA CTC CCG GAT 

Ala Gin Leu Trp Gly Phe Gly Arg Val Ala Ala Leu Glu Leu Pro Asp 

2105 2110 2115 

ATC TCC GGC CCC TTG ATC GAT CTG CCG GAG ACA GCG GAG CTG ACG CGG 
lie Trp Gly Gly Leu lie Asp Leu Pro Glu Thr Ala Glu Leu tS 
2120 2125 2130 

ACG CCG GAG ACC TCA CAG CCC CCA CAG ACC CCG GAC AGG CTG CCC CAC 
Thr Pro Glu Thr Ser Gin Pro Pro Gin Thr Pro Glu Arg Z £ §£ 
2135 2140 2145 2150 

ACT CCG AAC CCA CGC GCC CTT GAG CTT GCT GCC GCC GTC CTC GCC GGC 
Thr Pro Asn Arg Arg Ala Leu Glu Leu Ala Ala Ala Val 2S K 
2155 2160 2165 

f C GGC GAG GAC CAG GTC GCC GTG CGC GCC TCG GGG ATC TAC GGG 
Arg Asp Gly Glu Asp Gin Val Ala Val Arg Ala Ser Gly He Tyr Gly 
2170 2 175 2180 

I ™ ^ r° CGG GCC GCG GCG GGG GCC GCC TCC TCC CAG CCG 

Arg Arg Val Ser Arg Ala Ala Ala Ala Gly Ala Ala Ser Trp Gin Pro 
* 1Hb 2190 2195 

TCC GGC ACG GTG CTG ATC ACC GGC GGC ATG GGT GCC ATC GGC AGG CGG 
Ser Gly Thr Val Leu lie Thr Gly Gly Met Gly Ala Xli SJ £ 5 
2200 2205 2210 

CTC GCC CGC AGG CTG GCG GCC GAG GGA GCC GAA CGC CTG GTC CTC ACC 
Leu Ala Arg Arg Leu Ala Ala Glu Gly Ala Glu Arg Leu Val Leu Jhr 
" 15 2220 2225 2230 



6833 



6881 



6929 



6977 



7025 



7073 



7121 



7169 



7217 



7265 



7313 



7361 



7409 



7457 



7505 
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AGC CGT CGC GGA CCG GAG GCG CCG GGC GCC GCC GAA CTC GCC GAG GAA 
Ser Arg Arg Gly Pro Glu Ala Pro Gly Ala Ala Glu Leu Ala Glu Glu 
2235 2240 2245 

CTG CGA GGA CAT GGC TGC GAG GTC GTC CAC GCG GCC TGT GAC GTC GCC 
Leu Arg Gly His Gly Cys Glu Val Val His Ala Ala Cys Asp Val Ala 
2250 2255 2260 

GAG CGT GAT GCG CTC GCC GCG CTC GTC ACC GCG TAT CCG CCG AAC GCC 
Glu Arg Asp Ala Leu Ala Ala Leu Val Thr Ala Tyr Pro Pro Asn Ala 
2265 2270 2275 

GTC TTC CAC ACC GCC GGG ATT CTG GAC GAC GCG GTG ATC GAC ACG CTG 
Val Phe His Thr Ala Gly He Leu Asp Asp Ala Val He Asp Thr Leu 
2280 2285 2290 

TCA CCG GAG AGC TTC GAG ACC GTC CGC GGG GCG AAG GTC TGC GGC GCG 
Ser Pro Glu Ser Phe Glu Thr Val Arg Gly Ala Lys Val Cys Gly Ala 
2295 2300 2305 2310 

GAG CTG CTG CAC CAA CTG ACT GCG GAC ATA AAA GGG TTC GAC GCC TTC 
Glu Leu Leu His Gin Leu Thr Ala Asp lie Lys Gly Leu Asp Ala Phe 
2315 2320 2325 

GTC CTC TTC TCC TCC GTC ACC GGC ACA TCG GGC AAC GCC GGC CAG GGT 
Val Leu Phe Ser Ser Val Thr Gly Thr Trp Gly Asn Ala Gly Gin Gly 
2330 2335 2340 

GCG TAC GCC GCC GCC AAC GCC GCG CTC GAC GCC CTC GCC GAG CGT CGC 
Ala Tyr Ala Ala Ala Asn Ala Ala Leu Asp Ala Leu Ala Glu Arg Arg 
2345 2350 235S 

CGT GCC GCC GGA CTC CCC GCG ACC TCC GTC GCC TCG GGC CTT TCG GGC 
Arg Ala Ala Gly Leu Pro Ala Thr Ser Val Ala Trp Gly Leu Trp Glv 
2360 2365 2370 

GGG GGA GGC ATC GCG GCG GGT GCG GGC GAG GAG AGT CTC TCG CGG CGA 
Gly Gly Gly Met Ala Ala Gly Ala Gly Glu Glu Ser Leu Ser Arg Arg 
2375 2380 2385 2390 

GGG CTC CGG GCC ATC GAC CCC GAC GCG GCC GTC GAC GCG CTC CTC GGC 
Gly Leu Arg Ala Met Asp Pro Asp Ala Ala Val Asp Ala Leu Leu Gly 
2395 2400 2405 

GCC ATC GGC AGG AAC GAC GTC TCC GTC ACT GTC GTC GAC GTC GAC TCG 
Ala Met Gly Arg Asn Asp Val Cys Val Thr Val Val Asp Val Asp Trp 
2410 2415 2420 

GAG CGT TTC GCG CCC GCG ACG AAC GCC ATC CGT CCC GGG CGG CTC TTC 
Glu Arg Phe Ala Pro Ala Thr Asn Ala He Arg Pro Gly Arg Leu Phe 
2425 2430 2435 

GAC ACC GTC CCG GAG GCG CGG GAG GCC CTC ACG GCA CCC GGC ACC ACG 
Asp Thr Val Pro Glu Ala Arg Glu Ala Leu Thr Ala Ala Gly Thr Thr 
2440 2445 2450 

TCC GCG ACG CCG GAC GGC GCG CCC GAG CTG GCG CGG CGG TIG TCC ATC 
Ser Ala Thr Pro Asp Gly Ala Pro Glu Leu Ala Arg Arg Leu Ser Met 
2455 2460 2465 2470 



18 



EP 0 791 655 A2 



CTG AAC GAG ACC GAA CGC CTG COG AAG CTG GTC GAA CTC GTC CGT ACC 8273 
Leu Asn Glu Thr Glu Arg Leu Arg Lys Leu Val Glu Leu Val Arg Thr 
2475 2480 2485 

5 

GAG GCG GCC TTT GTG CTG CGG CAT CCG AAC ACG GAC GCC ATC GGC GCC 8321 
Glu Ala Ala Phe Val Leu Arg Hie Pro Asn Thr Asp Ala lie Gly Ala 
2490 2495 2500 

w GAA CGC CCG TTC AAG TCG GCC GGT TTC GAC TCC CTG ACC TCC CTG GAA 8369 

Glu Arg Pro Phe Lys Ser Ala Gly Phe Asp Ser Leu Thr Ser Leu Glu 
2505 2510 2515 

CTC CGC AAC CGC CTC AAT GCC GGC ACA GGC CTG AAG CTA CCC GCC ACC 8417 
Leu Arg Asn Arg Leu Asn Ala Gly Thr Gly Leu Lys Leu Pro Ala Thr 
15 2520 2525 2530 

GTC ATC TTC GAC CAC CCC AGC CCG ACC GCC CTG GCC AGA CTG CTG CTC 8465 
Val lie Phe Asp His Pro Ser Pro Thr Ala Leu Ala Arg Leu Leu Leu 
2535 2540 2545 2550 

20 

GAC CGG CTG ACC GGC GCC GGA GCC CCC GCG CCC GCC GCC GAT GAG CCG 8513 
Asp Arg Leu Thr Gly Ala Gly Ala Pro Ala Pro Ala Ala Asp Glu Pro 
2555 2560 2565 

CCA CTG CCC GTC GCC GTG GCC GAC GAC GAC CCG GTG GTC ATC GTC GGC 8561 
25 Pro Leu Pro Val Ala Val Ala Asp Asp Asp Pro Val Val He Val Gly 

2570 2575 2580 

ATG GCG TGC CGT TTC CCC GGT GGG GCC GGC ACC CCT GAG GCG CTG TCG 8609 
Met Ala Cy* Arg Phe Pro Gly Gly Ala Gly Thr Pro Glu Ala Leu Trp 
30 2585 2590 2595 

AAG CTG GTG ACC GAG GAG CGT GAC GTC ATA GGC GCC GCG CCC ACC GAC 8657 
Lys Leu Val Thr Glu Glu Arg Asp Val He Gly Ala Ala Pro Thr Asp 
2600 2605 2610 

35 

CGG GGC TGG GAC CTG GAT TCC GTC TAC GAC CCG GAG CCG GGT GTC GCG 8705 
Arg Gly Trp Asp Leu Asp Ser Val Tyr Asp Pro Glu Pro Gly Val Ala 
2615 2620 2625 2630 

GGG AAG ACA TAT GTG CGG GAG GGG GGT TTT CTC CAC GAC GCG GCG GAG 8753 
40 Gly Lys Thr Tyr Val Arg Glu Gly Gly Phe Leu His Asp Ala Ala Glu 

2635 2640 2645 

TTC GAC GCG GAG TTC TTC GGG ATT TCG CCG CGT GAG GCG GTC GCG ATG 8801 
Phe Asp Ala Glu Phe Phe Gly He Ser Pro Arg Glu Ala Val Ala Met 
45 2650 2655 2660 

GAT CCG CAG CAG CGG CTG TTG CTG GAG ACC TCC TGG GAG GCG ATC GAG 8849 
Asp Pro Gin Gin Arg Leu Leu Leu Glu Thr Ser Trp Glu Ala He Glu 
2665 2670 2675 

60 CGG GCG GGT ATC GAC CCG CAC TCG CTG CAC GGC AGC CGC ACC GGG GTA 8897 

Arg Ala Gly He Asp Pro His Ser Leu His Gly Ser Arg Thr Gly Val 
2680 2685 2690 

TAC GTC GGG CTG ACC CAT CAG GAG TAC GCC TCC CGG CTG CAC GAG GCC 8945 
55 Tyr Val Gly Leu Thr His Gin Glu Tyr Ala Ser Arg Leu His Glu Ala 

2695 2700 2705 2710 
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GTG GAT GAG CCG AGC CGG CAT GTG GAC TGG TCG GCT GGT GGT GTC CGG 
Val Asp Glu Pro Ser Arg His Val Asp Trp Ser Ala Gly Gly Val Arg 
2955 2960 2 965 

TTG CTG ACC GAG GCG CGG GAG TGG CCG GGG GTG GAC CGG CCG CGT CGG 
Leu Leu Ttir Glu Ala Arg Glu Trp Pro Gly Val Asp Arg Pro Arg Arg 
2970 2975 2980 

GCG GCG GTC TCC GCC TTC GGT GTC ACT GGT ACC AAC GCC CAT CTG ATC 

* Ma Xtl, Ser Ma Phe Gly Val Ser Gl * Ala His Leu lie 

2985 2990 2995 

CTC GAA GCC CCC GAA GCC CTC GAA GCC CTC GAA GCC ACC GAC GCC CCC 

?il Ala Pr ° ° 1U A1 * LeU G1U Ala ^ Glu Ala «»r Asp Ala Pro 
3000 3005 3010 

^ ff l CC ?? C CCC GAA ^ GAC GTC ACC GAC GTC ACC GAA 

Clu Ala Pro Glu Ala Pro Glu Ala Pro Asp Val Thr Asp Val Thr Glu 

3015 3020 302S 3030 

GCC CTC GAA GCC CCC GAC GCC ACC GAG GCG GAG GGT GCG AAG GCT CCT 
Ala Leu Glu Ala Pro Asp Ala Thr Glu Ala Glu Gly Ala Lys Ala Pro 
3 °35 3040 3045 

GGC ACT CCC GAA GAG GCA CAG CCT GCT GTS GGT GTS GTC CCG GTG GTG 
Gly Ser Pro Glu Glu Ala Gin Pro Ala Val Gly Val Val Pro Val Val 
3050 3055 3o 6 o 

vT c*^* J??" TOO CGG GTG GTG GTG CGG GAG GCT GCG GGC CGG TTG 

Val Ser Gly Arg Ser Ar 9 Val Val Val Arg Glu Ala Ala Gly LeS 
3 °65 3070 3075 

Sa ?? 2?° ?? C ^ m 01X3 000 GCC GA T GTG GCG GTG 

?o«ft 1 G1U AU Gly Gly VAl Cl * ^ U Al* Asp Val Ala Val 

3080 3085 3090 

TSi^^^f^r 5 ^ m ^ TAT ^^^^CTCGCT 
Thr Met Ala Gly Arg Ser Arg Phe Gly Tyr Arg Ala Val Val Leu Ala 

3100 3105 3no 

CGG GGT GAG GCT GAG CTT GCC GGG CGT TTG CGG GCG TTC GCC av ew 
Arg Gly Clu Ala Glu Leu Ala Cly Arg Leu Arg Ala Leu Ala Gly Gty 
3115 3120 31 2 5 

CAT CCG GAC GCG GGT GTC GTC ACC GGT GCG GTC GTC GAC CCG GAG ACG 
Asp Pro Asp Ala Gly Val Val Thr Gly Ala Val Val Asp Pro Glu Thr 
3130 3135 J140 

GGG TCC GGT GGT GGG GGC GTG GTC TTC GTT TTC CCT GGT CAG GGG ACG 
Gly Ser Gly Gly Gly Gly Val Val Leu Val Phe Pro Gly Gin SJ £ 
31 «S 3150 3155 Y 

n ^ 2"? °?° AT ° 0OT CCG GGG CTG CTG GGG TCT TCG GAG GTC TTT 

V% n V&1 Gly Met Gly Ala Gly Leu ^ Gly ser Ser Glu Val Phi 
3160 3165 3170 

GCG GCG TCG ATG CGG GAG TCT GCG CGG GCG CTC ACT GTT CAT C?rr 
Ala Al. ser Met Axg Glu Cys Ala Arg Ala Leu Ser SaT S S? SS 
31 5 3180 3185 3190 
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TOG GAT TTG CTG GAG GTG GTG TCG GGC GGG GCC GGG TTG GAG CGG GTG 10433 
Trp Asp Leu Leu Glu Val Val Ser Gly Gly Ala Gly Leu Glu Arg Val 
3195 3200 3205 

GAT GTG GTG CAG CCC GTG ACG TGG GCG CTG ATG GTG TCG CTG GCC CGG 10481 
Asp Val Val Gin Pro Val Thr Trp Ala Val Met Val Ser Leu Ala Arg 
3210 3215 3220 

TAC TGG CAG GCC ATG GGT GTG GAC GTG GCT GCG GTG GTG GGT CAT TCC 10529 
Tyr Trp Gin Ala Met Gly Val Asp Val Ala Ala Val Val Gly His Ser 
3225 3230 3235 

CAG GGG GAG ATC GCT GCT GCC ACG GTG GCG GGG GCG TTG TCG CTG GAG 10577 
Gin Gly Glu lie Ala Ala Ala Thr Val Ala Gly Ala Leu Ser Leu Glu 
3240 3245 3250 
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GAT GCG GCG GCT GTG GTC GCT CTG CGG GCG GGG TTG ATT GGC CGG TAT 10625 
Asp Ala Ala Ala Val Val Ala Leu Arg Ala Gly Leu He Gly Arg Tyr 
3255 3260 3265 3270 

CTG GCG GGT CGT GGT GCG ATG GCG GCT GTT CCG CTG CCT GCC GGC GAG 10673 
Leu Ala Gly Arg Gly Ala Met Ala Ala Val Pro Leu Pro Ala Gly Glu 
3275 3280 3285 

GTC GAG GCC GGG CTG GCG AAG TGG CCC GGA GTA CAG GTA GCC GCG GTC 10721 
Val Glu Ala Gly Leu Ala Lys Trp Pro Gly Val Gin Val Ala Ala Val 
3290 3295 3300 

AAC GGT CCG GCG TCC ACG GTG GTT TCC GGG GAT CGG CCG GCG GTG GCC 10769 
Asn Gly Pro Ala Ser Thr Val Val Ser Gly Asp Arg Arg Ala Val Ala 
3305 3310 3315 

GGT TAT GTG GCC GTC TGT CAG GCG GAG GGT GTG CAG GCT CGG* TTG ATA 10817 
Gly Tyr Val Ala Val Cys Gin Ala Glu Gly Val Gin Ala Arg Leu He 
3320 3325 3330 

CCG GTG GAC TAC GCC TCT CAC TCC CGC CAT GTG GAG GAC CTG AAG GGC 10865 
Pro Val Asp Tyr Ala Ser His Ser Arg His Val Glu Asp Leu Lys Gly 
3335 3340 3345 3350 

GAG TTG GAG CGG GTG CTG TCC GGT ATC CGC CCC CGC AGT CCG CGG GTG 10913 
Glu Leu Glu Arg Val Leu Ser Gly He Arg Pro Arg Ser Pro Arg Val 
3355 3360 3365 

CCG GTG TGT TCC ACC GTC GCC GGA GAG CAG CCG GGC GAG CCG GTT TTC 10961 
Pro Val Cys Ser Thr Val Ala Gly Glu Gin Pro Gly Glu Pro Val Phe 
3370 3375 3380 

GAT GCG GGG TAT TGG TTC CGT AAT CTG CCG AAC CGG GTT GAG TTC TCC 11009 
Asp Ala Gly Tyr Trp Phe Arg Asn Leu Arg Asn Arg Val Glu Phe Ser 
3385 3390 3395 

GCG GTG GTC GGT GGT TTG TTG GAG CAG GGC CAC CGT CGG TTC ATC GAG 11057 
Ala Val Val Gly Gly Leu Leu Glu Gin Gly His Arg Arg Phe He Glu 
3400 3405 3410 

GTC AGT GCC CAC CCG GTA CTC GTC CAT GCC ATT GAG CAG ACG GCC GAG 11105 
Val Ser Ala His Pro Val Leu Val His Ala He Glu Gin Thr Ala Glu 
3415 3420 3425 3430 
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GCC CCG GAC CGG ACT GTC CAT GCC ACC GGA ACC CTC CGC CGC CAG GAC 
Ala Ala Asp Arg Ser Val His Ala Thr Gly Thr Leu Arg Arg Cln Asp 
3*35 3440 3445 

GAC ACC CCG CAC CGC CTG CTG ACC TCC ACC GCC GAG GCC TOG GCC CAC 
Asp Ser Pro His Arg Leu Leu Thr Ser Thr Ala Glu Ala Trp Ala His 
3450 3455 3460 

QGC GCC ACC CTC ACC TGG GAC CCC GCC CTC CCC CCA OCT CAC CTC ACC 
Gly Ala Thr Leu Thr Trp Asp Pro Ala Leu Pro Pro Gly His Leu Thr 
3465 3470 3475 

ACC CTC CCC ACC TAC CCC TTC AAC CAC CAC CAC TAC TOG GCC GTC ACA 
Thr Leu Pro Thr Tyr Pro Phe Asn His His His Tyr Trp Ala Val Thr 
3480 3485 3490 

TCC CCC GCC GGA GTC QGC GAC GCG GCT CCG GGC CGG TTC GGT ATC ACC 
Ser Pro Ala Gly Val Gly Asp Ala Ala Ala Gly Arg Phe Gly Met Thr 
3495 3500 3505 3510 

TGG GAG GAC CAC CCC TTC CTC CGT GGC GGG TTA CCC CTC GCC GAC TCC 
Trp Clu Asp His Pro Phe Leu Arg Gly Gly Leu Pro Leu Ala Asp Ser 
3515 3520 3525 

GGT GAG CGG GTC TTC GCC GGG CGG CTC GCG GGC TCC GAG CAC GAC TCC 
Gly Glu Arg Val Phe Ala Gly Arg Leu Ala Gly Ser Glu His Asp Trp 
3530 3535 354 0 

CTC ACC GAC CAT GCC CTG TCC GGG GTC ACC TTC CTC CCG GGT ACC GCC 
Leu Thr Asp His Ala Val Ser Gly Val Thr Leu Leu Pro Gly Thr Ala 
3545 3550 3555 

TTC GTC GAG TTC GCG CTG CAC GCG GGA GCC GCC ACC GGC TCC GGG CGG 
3560 L * U "sis* 13 A1 * A1 * Thr Gly 0/8 Gly *** 

CTC GAA GAG CTG AGC CTT CAG GCG CCG TTC GTC TTC CCC GCC GCC OCT 
Leu Glu Glu Leu Ser Val Clu Ala Pro Leu Val Leu Pro Ala Ala Gly 
3575 3580 35 8 5 35 | 0 

GGT GTC CGG GTC CAG ATC AGG GTC TCG GCC GCC GAC GAG TCG GGA OCT 
Gly Val Arg Val Cln Met Arg Val Ser Ala Ala Asp Glu Ser Gly Arg 
3595 3600 i605 

CGG AGG GTC GCC ATC CAC TCG GCC CCG GAA GCC GCC GTC CAC TCG GCC 
Arg Arg Val Ala lie His Ser Ala Pro Glu Ala Ala Val Hi. Ser Ala 
3610 3615 3620 

CCA GAA GGC GGC GAC TCG GCC GGT GTC TGG ACG CGG CAC GGC GAG GGC 
Ala Glu Gly Gly Asp Ser Ala Gly Val Trp Thr Arg His Gly Glu Gly 
3625 3630 36 35 

ACG CTC GTC CCG GAC CCG GAG CCC ACG CCT CCG GAC GCC GAC TCC CCC 
Thr Leu Val Pro Asp Pro Glu Pro Thr Pro Pro Asp Ala Asp Trp Ala 
364 ° 364S 3650 

CGG GCC TCG CCG CCC GCC CGG GAA CGC GTC GAA CCG GCC CAG CTC TAC 
Arg Ala Trp Pro Pro Ala Gly Glu Arg Val Glu Pro Ala Glu Leu Tyr 
3655 3660 3665 3570 
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GAA CGG TTC GGG GCC CTC GGC TAC GAG TAC GGT GAG GCG TTC GCG GGC 
Glu Arg Phe Gly Ala Leu Gly Tyr Glu Tyr Gly Glu Ala Phe Ala Gly 
3675 3680 3685 

GTC CGC GCC GTA TGG CGG CAG CCG GAC GCG CTG CTC GCC GAG GTC CTC 
Val Arg Ala Val Trp Arg Gin Pro Asp Ala Leu Leu Ala Glu Val Leu 
3690 3695 3700 

CTG CCC GAC CGG GCC TCG ACC GGT GCC GGC CGG TTC GGT GTG CAC CCC 
Leu Pro Asp Arg Ala Ser Thr Gly Ala Gly Arg Phe Gly Val His Pro 
3705 3710 3715 

GCG CTG CTG GAC GCG GCG CTG CAG CCG TGG ATC GCC GGT GGT CTC CTC 
Ala Leu Leu Asp Ala Ala Leu Gin Pro Trp He Ala Gly Gly Leu Leu 
3720 3725 3730 

GAA GTG CCG GAG GAC GCA GTG CTG CTG CCC TTC GCC TGG CAG GGA GTG 
Glu Val Pro Glu Asp Ala Val Leu Leu Pro Phe Ala Trp Gin Gly Val 
3735 3740 3745 3750 

TCG CTC TAC GCG ACG GGT GCC GGT GCT CTG CGG GTG CGG CTC ACG AAG 
Ser Leu Tyr Ala Thr Gly Ala Gly Ala Leu Arg Val Arg Leu Thr Lys 
3755 3760 3765 

GCG GGT GAC GGG GCG GTC TCG CTC CAG GCC GCA GAC ACG TCC GGC GCG 
Ala Gly Asp Gly Ala Val Ser Leu Gin Ala Ala Asp Thr Ser Gly Ala 
3770 3775 3780 

GCC GTG CTC TCC TTG GQG GCC CTG GTG ATG CGT CCG CTG GCG CGC CGG 
Ala Val Leu Ser Leu Gly Ala Leu Val Met Arg Pro Leu Ala Arg Arg 
3785 3790 3795 

AAG CTG GAC GTG CTG CTC GGC ACG GAC GCC GGC GAA CGG TCG CTG TAC 
Lys Leu Asp Val Leu Leu Gly Thr Asp Ala Gly Glu Arg Ser Leu Tvr 
3800 3805 3810 

CGC GTC GAG TGG CAG CCG CGG CTC CTG CCC GCC GGC CCG CCG CGC TCC 
Arg Val Glu Trp Gin Pro Arg Leu Leu Pro Ala Gly Pro Pro Arg Ser 
3815 3820 3825 * 3830 

TGG GCG GTG CTC GGC CCC GAC GCG GAC CGG CTC GCC GGG ACG CCG GGC 
Trp Ala Val Leu Gly Pro Asp Ala Asp Arg Leu Ala Gly Thr Pro Gly 
3835 3840 3845 

CTG GGG GAT CAG CCG GAC GGT GGG CCC ACC GCG CTC TAC CCG GAG GTC 
Leu Gly Asp Gin Pro Asp Gly Gly Pro Thr Ala Leu Tyr Pro Glu Val 
3850 3855 38 60 

CGG GCG CTG CGG AAG GCG CTG GCG GCG GGC GCG CCG CGG CCG GAA GCG 
Arg Ala Leu Arg Lys Ala Leu Ala Ala Gly Ala Pro Arg Pro Glu Ala 
3865 3870 3875 

GTC GTA CTC CCG GTG CTC TCC GGG GCC GGG GCC ACT CCG GAG TCG GTC 
Val Val Leu Pro Val Leu Ser Gly Ala Gly Ala Thr Pro Glu Ser Val 
3880 3885 3890 

CGG CAG ACA ACG GAG CGC TGT CTG ACC GCG CTC CAG GAC TCG CTC GAC 
Arg Gin Thr Thr Glu Arg Cys Leu Thr Ala Leu Gin Asp Trp Leu Asd 
3895 3900 3905 3 910 



11873 



11921 



11969 



12017 



12065 



12113 



12161 



12209 



12257 



12305 



12353 



12401 



12449 



12497 



12545 



24 



EP0 791 655 A2 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



!£ ^ S° i™ S*? GAC ACA CCG CTC ATA GTC CTC ACC AGG GGA GCC 
Ala Glu Glu Leu Val Asp Thr Pro Leu He Val Leu Thr Arg Gly Ma 
391S 3920 3 / 2S 

Val JU* JU^ i/^ o5^* 0(30 ^ C * I>C * r 00(3 

Val Ala Ala Val Pro Gly Glu Glu He Gly Asp Leu Ala Cya Ala Gly 

3930 3 *3S 39 40 Y 

Val Trp Gly Leu Val Arg Ser Ala Arg Ser Glu His Pro Gly Ar^ S 

. 3950 3955 

GCC CTC GTC GAC ACC GAC GGG CAT CCG GAC GAC CCC ACC GCG CTG CCC 
Ala Leu V.1 Asp Thr Asp Gly His Pro Asp Asp Arg Thr Ala Leu Pro 
3960 3965 3970 

CTC^CTCCGCGCGGTCCTCGACC^C^CGGCCAGCTCTCCCTaCGG 
Leu A la Leu Arg Ala Val Leu Asp Gly Ala Gly Gin Leu Ser Leu Arg 

* 3980 3985 3990 

?5 C 5?° «* CGG ACC CCG GTC CTC CTC CGG GCC GGG ACC CCG GAG 

Ala Gly Thr Ala Arg Thr Pro Val Leu Leu Arg Ala Gly Thr Pro S£ 
3995 4000 4 005 

GAG CAG CGG GGT CCG GCA TTC GAC CCG GCG GGC ACG CTC CTG GTC ACQ 
Glu Gin Arg Gly Pro Ala Phe Asp Pro Ala Gly Thr 25 2 S? JS 
4010 4015 4020 

^ ff* ^ 000 ACG ^ 6(3(3 000 CTG GCC CGG CAT CTG CCC GCC 

Gly Ala Thr Gly Thr Leu Gly Arg Leu Leu Ala Arg Ms Sa Sa 
,uz:> 4030 403S 

GAG CAC GGT GTG CGC CAT CTG CTG CTG CTG AGC CGC GGC fifir mr 
Glu Ji^Gly Val Arg His LjuLeu Leu Leu oTy Art SI 

JUolu Gly Ala Asp Glu Leu Ala Ala Glu Leu Ala oTy 22 SI Sa 

u ^065 407Q 

4W/5 4080 40a5 

CGG GTG CTG GCG GAG GTG COG GCC GAC CGG COG CTG ACC CCA r*r »av 

113 v " Uu ^ 0 clu v " - a*. S 2 S £ S S 

,090 4095 4100 

S S ° If? CTC GAC GAC GGC ACA CTC GAC GCG CTG ACC CCG 

Hxs Ala A a Gly Val Leu Asp Asp Gly Thr Leu Asp Ala 2 £ S 
* 1W3 4110 4U5 

GAA CGG ATC GGT ACC GTC ATG CGG CCG AAG GCG GAC GCG GCG CTC AAC 
Glu Arg lie Gly Thr Val Met Arg Pro Lys Ala Asp Ala Ala Leu 

4125 4130 

CTG CAC GAA CTG ACC CGG ACC AGC CCG CTG TCG GTC TTC GCC r*m 

«„ uu «, ^ s . r Pro uu Z 2 S S 
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TCG GGC GCT GCC GGC ATC CTG GGC CGC CCC GGA CAG GCC AAC TAC GCC 
Ser Gly Ala Ala Gly He Leu Gly Arg Pro Gly Gin Ala Asn Tyr Ala 
*155 4160 4165 

GCC GCC AAC ACC TTC CTC GAC GCG CTC GCG CAG CAC CGC CGC GCC CAC 
Ala Ala Asn Thr Phe Leu Asp Ala Leu Ala Gin His Arg Arg Ala His 
41™ 4175 4180 

GGC CTC CCC GCC CTG TCG CTG GCC TCG GGG CTG TOG GGC GGC GCG ACC 
Gly Leu Pro Ala Val Ser Leu Ala Trp Gly Leu Trp Gly Gly Ala Thr 
4185 4190 4195 

GGC ATG ACC GGC CAT CTG TCC GGC ACC GAT CTC CGC CGG ATC CGC AGG 
Gly Met Thr Gly His Leu Ser Gly Thr Asp Leu Arg Arg Met Arg Arg 
4200 4205 4210 

TCC GGT ATC GCG CCG ATC ACC CAC GAC CAG GGG CTC GCC CTC TTC GAC 
Ser Gly He Ala Pro Met Thr His Asp Gin Gly Leu Ala Leu Phe Asp 
4215 4220 4225 4230 

CGA GCG CTC GCC GCC TCG GCC GAG GAC CCG CTC CTC GTA CCG ATC CGG 
Arg Ala Leu Ala Ala Ser Ala Glu Asp Pro Leu Leu Val Pro Met Arg 
4235 4240 4245 

CTC GAC CTC GCC GCC CTC GTC CGG GAG CGG GCC GAG CAC GGG CCG GAC 
Leu Asp Leu Ala Ala Leu Val Arg Glu Arg Ala Glu His Gly Pro Asp 
4250 4255 4260 

^t** < ~^~ C< ^ G CCC CGG GCC GCG GTC 

Ala Val Pro Gly Pro Leu Leu Gly Leu Leu Pro Ala Arg Ala Ala Val 
4265 4270 4275 

CGG CAG GCG GCG GCA CCG GTA CGC GGC GGA GCC CCC GCC CCC GCC GGC 

™ A1 * AU M * Pr ° Val Gly G1 * Ala Pro A1 » Pro Ala Gly 

4280 4285 4290 

GGC GAG GGG ACG GCC GAG CGG CTC GCC GGG CTC GGG GAG GAG CCC AGG 
Gly Glu Gly Thr Ala Glu Arg Leu Ala Gly Leu Gly Glu Glu Ala Arg 
4295 4300 4305 43 10 

CTC CGC GAG CTC GTC AGG CTC GTC CGC GCC GAG GTC TOG GGC GTC CTC 
Leu Arg Glu Leu Val Arg Leu Val Arg Ala Glu Val Ser Gly Val Leu 
4315 4320 4325 

GGC TAC TCG CGC CCG GAC GCG GTC GAG CCC GGG CGC CCG TTC AAG GAT 
Gly Tyr Ser Gly Pro Asp Ala Val Glu Pro Gly Arg Pro Phe Lys Asp 
4330 4335 4340 

CTC GGC TTC GAC TCG CTC ACC GCC GTC GAG CTC CGC AAC CGC CTC GGC 
Leu Gly Phe Asp Ser Leu Thr Ala Val Glu Leu Arg Asn Arg Leu Gly 
4345 4350 4355 * 1 

GCC GCC ACC GGG CTG CGG CTC CCG ACC GCG CTC GTC TTC GAC CGC CCG 
Ala Ala Thr Gly Leu Arg Leu Pro Thr Ala Leu Val Phe Asp Arg Pro 
4360 4365 437Q 

ACG TCC CAG GCA GTC GCC GAG TAC CTC GCT GCC GAA CTC GCC GGA CCG 
Thr Ser Gin Ala Val Ala Glu Tyr Leu Ala Ala Glu Leu Ala Gly Pro 
4375 4380 4385 4390 
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COG GAC GGC GGC GAC ACC GCG GCC GCC GCG TTC GAG GGC CTG GAG GCG 
Arg Asp Gly Gly Asp Thr Ala Ala Ala Ala Phe Glu Gly Leu Glu Ala 
439S 4400 4405 

CTG GCC GCG GOG GTG GGC GCG CTG GCC GAG GAC CAT CTG CGG CGC GAC 
Leu Ala Ala Ala Val Cly Ala Leu Ala Glu Asp Asp Leu Arg Arg Asp 
4410 4415 4420 

GTG CTC CGG CGG CGA CTG ACC GAA CTG GCC GCC GCC CTC ACC CCG CAG 
Val Leu Arg Arg Arg Leu Thr Glu Leu Ala Ala Ala Leu Thr Pro Gin 
4 *25 4430 4435 

GGC CGG AAC CCC TCC GCG CCC GCA CCC GCC CCG TCC GAT CTG GAC GAG 
Gly Arg Asn Pro Ser Ala Pro Ala Pro Ala Pro Ser Asp Leu Asp Glu 
4440 4445 4450 * 

CCG CTG GAC TCC GCG AAC GAC GAC GAC CTC TTC GCC TTC ATC GAG GAG 
Arg Leu Asp Ser Ala Asn Asp Asp Asp Leu Phe Ala Phe He Glu Glu 
4455 4460 4465 4470 

CAG CTT TGA CCAGCGAGAA CGACAGCAGC GAGAACGACG GCGACGACAC 
Gin Leu * 

CCCCGGGCCA GCTCCGGGGA CCCCTCCGGG GGCCTCCCCC CGGCAGGACG ACCGGGTCAG 

GGAGTATCTG AAGCGG GTG ACC GCC GAA CTC GTC GCC ACC CGC AAC CGG 
Met Thr Ala Glu Leu Val Ala Thr Arg Lys Arg 
1 5 10 

CTC GCG CTC GAG GAG CGG GCC CGC GAA CCG ATC GCC GTC GTC Ga 

Leu Gly Ala Leu Glu Glu Arg Ala Arg Glu Pro lie Ala Val Val Sa 
15 20 25 

ATC AGC TCC CGC TAC CCG GGC GGG GTC ACC ACC CCC GAG GAC CTC TCC 
Met Ser Cys Arg Tyr Pro Gly Gly Val Thr Thr Pro Glu Asp Leu Trp 

35 40 

CGG CTT CTC GCG GAC GAA CGC GAC GCC GTA TCC GGA CTT CCC CGG GAC 
Arg Leu Leu Ala Asp Glu Arg Asp Ala Val Ser Gly Leu Pro Arg Asp 

50 55 

^ ^° ^ 0X8 ^ 000 010 TAC GAC CCC GAC GGC GGC CCC GGC 

Arg Gly Trp Asp Leu Asp Ala Leu Tyr Asp Pro Asp Cly Gly Pro Gly 

65 70 75 

»ph^ GGC l *"^ C AGC CAC TCC GCC GGA TTC 

Thr Ser Tyr Ala Arg Glu Gly Gly Phe Leu Ser His Cys Ala Gly p£ 
80 85 90 

GAC GCG GAG TTC TTC GGC ATC TCC CCG CGC GAG GCG CTG GCG ATC GAC 
Asp Ala Glu Phe Phe Gly lie Ser Pro Arg Glu Ala Leu aS SZ 2p 
95 100 10 5 

11° 2° CC^ CIG CTG CTG GAG ACC TCC TGC CAG GCC CTG GAA CGC 
Pro Gin Gin Arg Leu Leu Leu Glu Thr Ser Trp Glu Ala Leu Glu 
110 115 120 

S £° ^° GAC CGC GCC CGG GGC AGC CGG ACG GGC GTG TAC 

Ala Gly Val Thr Ala Asp Arg Ala Arg Gly Ser Arg Thr Gly Val Tyr 
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125 130 135 

GCG GGC GTC ATC TAC GAC GAC TAC GGC GCC COG GTC CTC TAC GGC GCC 14815 
Ala Gly Val Met Tyr Asp Asp Tyr Gly Ala Arg Val Leu Tyr Gly Ala 
140 145 150 155 

GGC GCC GGC CCG CCC GAG GAC CTG GAG GGT TAT CTC GTC AAC GGC AGC 14863 
Gly Ala Gly Pro Pro Glu Asp Leu Glu Gly Tyr Leu Val Asn Gly Ser 
160 165 170 

GCG GGC AGC ATC GCC TCC GGC CGT GTC TCC TAC ACG TTC GGG CTC CGC 14911 
Ala Gly Ser He Ala Ser Gly Arg Val Ser Tyr Thr Phe Gly Leu Arg 
175 180 185 

GGC CCC GCG GTC ACC GTC AAT ACG GCC TGT TCG TCG TCA CTG GTG TCG 14959 
15 Gly Pro Ala Val Thr Val Asn Thr Ala Cys Ser Ser Ser Leu Val Ser 

190 195 200 

CTC CAT CTG GCG GTG CGT GCC CTC CGG AAC GGC GAG TCC GAC ATC GCA 15007 
Leu His Leu Ala Val Arg Ala Leu Arg Asn Gly Glu Cys Asp Met Ala 
20 205 210 215 

CTC GCC GGC GGG GCG ACG GTC CTC TCC ACC CCC ACC GTC CTC GTC GAC 15055 
Leu Ala Gly Gly Ala Thr Val Leu Ser Thr Pro Thr Val Leu Val Asd 
220 225 230 2 35 

25 TTC TCC CGC CAG CGC GGT CTC GCC CCC GAC GGC CGC TCC AAG GCG TTC 15103 

Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg Cys Lys Ala Phe 
240 245 250 

GCC GAC TCC GCC GAC GGC ACC TCC TCG GCC GAG GGC GCC GGA ATC CTC 15151 
30 Ala Asp Ser Ala Asp Gly Thr Ser Trp Ala Glu Gly Ala Gly Met Leu 

255 260 265 

CTC CTC CAG CGG CTC TCC GAC GCC CGC CGC GAG GGG CGC CCC GTC CTC 15199 
Leu Leu Gin Arg Leu Ser Asp Ala Arg Arg Glu Gly Arg Pro Val Leu 
35 270 275 280 

GCC GTC ATT CGC GGC TCG GCC GTC AAC CAG GAC GGC GCC AGC AAC GGA 15247 
Ala Val He Arg Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Glv 
285 290 



295 



15295 



40 CTC ACC GCC CCC AAC GGG COG GCG CAG CGG CAG GTC ATC GAG GAC GCG 

Leu Thr Ala Pro Asn Gly Arg Ala Gin Arg Gin Val He Glu Asp Ala 
300 305 310 315 

CTC CGC GAC GCC GGG GTC GGC CCC GAC CAG GTC GAC GCG GTC GAG GCG 15343 
^ Leu Arg Asp Ala Gly Val Gly Pro Asp Gin Val Asp Ala Val Glu Ala 

3 20 325 330 

CAT CGC ACC GGT ACC GAG CTC GGC GAC CCC ATC GAG GCC GGG GCG CTC 15391 
His Gly Thr Gly Thr Glu Leu Cly Asp Pro He Glu Ala Gly Ala Leu 
335 340 345 

SO 

CTC GCC ACC TAT GGA ACG GCC CGT ACG GCG GAG CGC CCG CTC TCG CTC 15439 
Leu Ala Thr Tyr Gly Thr Ala Arg Thr Ala Glu Arg Pro Leu Trp Leu 
350 355 360 

55 GGC TCC CTC AAG TCC AAC ATC GGG CAC ACC CAG GCC GCC GCC GGT GOT 15487 

Gly Ser Leu Lys Ser Asn He Gly His Thr Gin Ala Ala Ala Gly Val 
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365 37 0 



375 



^ ^ ^ **° ATG 010 0X0 003 ATC 000 0710 000 CCC CTC CCC 15535 
Ala Cly Val lie Ly. Met Val Leu Ala Met Arg His Gly Arg Leu Pro 

380 385 390 395 

%° 7™ GTC GAC CCG CCC ACC ACC CGG GTG GAC TOO GAQ AAG 15583 

Arg Thr Leu His Val Asp Arg Pro Thr Thr Arg Val Asp Trp Glu Lys 
400 405 410 

G^GGGCTG^CTCCTCACC<»GCCGGTGCCATC»CCGGGG<^G<» 15631 
Gly Gly Val Arg Leu Leu Thr Glu Pro Val Pro Trp Pro Gly Glu Ala 
415 420 42S 

CGG GAG CCG CGT CGC GCG GGC GTG TCC TCC TTC GGC GCG AGC OGC ACG IROQ 
Gly Glu Pro Arg Arg Ala Cly Val Ser Ser Phe Gly S Sr ^ l5<?9 
430 435 440 

AAC CCC CAT CTC GTG CTC GAG AGC GTC CCG GCC CGT GAA CCG CCC GCC 15727 
Asn Ala His Val Val Leu Glu Ser Val Pro Ala Cly Glu Pro S Sa 
20 445 450 455 



w 



15 



25 



35 



40 



GCC GCG CGG CCG GAG GAC ACA GGC GGC GCC TCC ACG GTC AGC GGC CGC 
Ala Gly Arg Pro Glu Asp Thr Gly Gly Ala Trp Thr Val Ser GW Arg 
* b0 465 470 475 

2° ^ ^ 1°° ° CAG CGG ^ TAC GAC GCC CTC 

Gly Pro Ala Ala Leu Arg Ala Gin Ala Ala Arg Leu Tyr Asp Ala Leu 

480 485 490 



CCC CGC ACC GCC GAG GTG GCC CGC GCG CTG GCC CAC CCC CGT ACC GCC 
Pro Gly Thr Ala Glu Val Ala Gly Ala Leu Ala His Ala Arg Thr Ala 
Si0 515 520 

TTC CGG CAC CGG CCC GTC GTG CTC GGC GGA AAC CGC GCC GAA CTG CTC 
Phe Arg His Arg Ala Val Val Leu Gly Gly Asn Arg Ala Glu Leu 2u 
^ D 530 535 

CCG OGC CTG CGC GAG CTG GCG GAG GAG GAG CAT CCC GGA CCC CGC GTC 
Ala Gly Leu Arg Glu Leu Ala Glu Glu Glu His Pro Gly Pro Arg Val 
a * W 545 550 555 

ST £ A ^° US 0 000 CCG ^ A<X GAG CGG C«3 ACG GCC TTC CTC TTC 
Val Thr Gly Thr Ala Pro Ala Thr Glu Arg Arg Thr Ala Phe Leu p£ 

560 56S 570 

^ ^ ^ ^ AGC CAG CGG GCC GGC TCC GGC CGG GGC CTG TAC CGC 
Ser Cly Gin Gly Ser Gin Arg Ala Gly Ser Cly Arg Cly Leu Ty"r 2g 
50 575 580 585 

CGC CAC CCG GTC TTC GCC CGC GCC CTC GAC GAG GTC TCC GCC GCG CTC 
Arg His Pro Val Phe Ala Arg Ala Leu Asp Glu Val Cys Ala £S 
M0 59S 600 



55 ^ f AT m ^ CGC CCC 0X5 CGT GAC CTG ATG TTC GCC GAG CCC 

Glu Pro His Leu His Arg Pro Leu Arg Asp Leu Met Phe Ala Glu Pro 



15775 



15823 



^ C 5°° ff C 000 ACC GGC ACC GGA CAG GGC GCC GGA CAG GGC GCC GGA 15871 
Thr Cly Thr Gly Thr Cly Thr Gly Gin Gly Ala Gly Gin C^ S ™ 
495 500 505 



15919 
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16159 



16207 
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605 610 615 

GGC AGC COG GAA GCG GAG CCG CTC GAC CGC ACC GAG TTC ACC CAG CCC 16255 
Ciy Ser Pro Glu Ala Glu Pro Leu Asp Arg Thr Glu Phe Thr Gin Pro 
620 625 630 635 



CCG CTC TTC GCG CTC CAG ACC GCC CTG TTC CGG CTG GCC GAG CAC CAC 16303 
Ala Leu Phe Ala Leu Gin Thr Ale Leu Phe Arg Leu Ale Glu His His 
640 645 650 

CGG CTG CGC GCC GAG GCG CTG TQC GOG CAC AGC GTC GGC GAG ATC GCG 16351 
Gly Leu Arg Ale Glu Ale Leu Cys Gly His Ser Va l G i y G £ J£ 5° 
655 660 665 

IS f?° ^f C 5^ G ? C GOT GTG CTG ACC CTG CCC GAC GCG GCC CGT CTC 16399 

» Ale Ale His Ala Ala Gly Vel Leu Thr Leu Pro Asp Ale Ale Arg Leu 

670 675 680 

CTG GCC GCC CGG GGA CGG CTG ATG CAG GCC CTG CCG GCC GGC GOT GCC 
Val Ale Ale Arg Gly Arg Leu Met Gin Ala Leu Pro Ala Gly Gly Ala 
20 685 690 695 

SI? S5f ff 8 ?* £ C ACC 000 GAG GAG ATC CCA CCA CTG CTG GAG 

Met Ala Ala Leu Arg Ala Thr Ala Glu Glu He Ala Pro Leu Leu Glu 

'05 710 



CGC CGC GCG GGC GAA CTG GCG CTC GCC GCC GTC AAC GGC CCC TCC TCC. 
Arg Arg Ala Gly Glu Leu Ala Leu Ala Ala Val Asn Gly Pro £r <Er 
720 725 73 0 



7 35 7 40 



745 



CTC ACC CGT AGO CCC GTC ACC GAG GAA ACC CTG TGC ACC GCG GAC CAC 
Leu Thr Gly Thr Pro Val Thr Glu Glu Thr Leu Cys Thr Ala Asp His 
800 80S 81 0 

TGC GTC CGC CAG GCG CGC GAG CCG GTG CGC TTC CTG GAC GCC ATG CGG 
Trp Val Arg Gin Ala Arg Glu Pro Val Arg Phe Leu Sp £a 22 S 
50 815 820 825 

ACC CTG CGC GCC GAC CGG ATC GAC ACC TTC GTG GAA CTC GGC CCC GAC 
Thr Leu Arg Ala Asp Gly He Asp Thr Phe Val Glu Leu Gly Pro So 
830 835 



840 



GGC GTG CTG TCC GCC ATG GCC CGC GAC TGC GCG GAC GAC CGG CCC GAT 
Gly Val Leu Ser Ala Met Ala Arg Asp Cys Ala Asp Asp Arg Pro £p 



16447 



16495 



16543 



S? ST 2? 1°° 2f GAC GAG 000 000 0X0 01X2 CT A CTC GAG CAG 16591 
30 Val Val Val Ser Gly Asp Glu Ala Ala Val Leu Glu Leu Leu Glu Gin 



TCG CGG GCC GAG GGC CGC GAG GCC AGG CGG CTC GCC GTC AGC CAT GCC 
Trp Arg A a Glu Gly Arg Glu Ala Arg Arg Leu Ala SJ Set Ms 5a 

TTC CAT TCA CCG COG ATG GAC GGA ATG TTC ACA CAG TTC GAC CGG GTC 
Phe His Ser Pro Arg Met Asp Gly Met Leu Thr Gin Phe Asp Arg 

CTC ACG TTC GCT CCG CCG ACC ATT CCC CTC GTC TCC ACC 16735 
Ala Arg Tfcr Leu Thr Phe Ala Pro Pro Thr He Pro Leu Val Ser 
780 785 790 795 



16639 



16687 



16783 



16831 



16879 
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845 850 855 

GGC GAC ACA ACC GGG GCC GGG GAC GGG GAG ACC CCC GAT CCG CTC CTC 
Gly Asp Thr Thr Gly Ala Gly Asp Gly Glu Thr Pro Asp Pro Lou Leu 
*6° 865 870 8 75 

ACC CTC CCG CTG CTG CGC CGC TCC GTG CCC GAG ACC GGC GAC GCC GAA 
Thr Leu Pro Leu Leu Arg Arg Ser Val Pro Glu Thr Gly Asp Ala Glu 
880 885 890 

CAC CCC GGC GGC TTC GAA CGG GCC CTC GCC ACC GCC TAC GCA CAC GGC 
His Pro Gly Gly Phe Glu Arg Ala Leu Ala Thr Ala Tyr Ala His Gly 
895 900 905 

GTC CCC CTC CGG CTC GCG CCC GCC CCC GAC GCC GCG TCC CTC GCC GTC 
Val Pro Leu Arg Leu Ala Pro Ala Pro Asp Ala Ala Ser Leu Ala Val 
910 915 920 

GCC GCC GAA CTG CCC ACC TAC GCC TTC CAG CCC ACC CAC TAC TGG CTC 
Ala Ala Glu Leu Pro Thr Tyr Ala Phe Gin Arg Thr His Tyr Trp Leu 
925 930 935 

GAC GCG CCC GCC GCC CCC GCC GCC CTC CCC GCC GGG CTC GAC GAC GCC 
Asp Ala Pro Ala Ala Pro Ala Ala Leu Pro Ala Gly Leu Asp Asp Ala 
940 945 950 " 955 

GGT CAC CCG CTG CTC T^CC GCG GCC CTC GAC CTC CCC GGC GGA CGC GGA 
Gly His Pro Leu Leu Ser Ala Ala Leu Asp Leu Pro Gly Gly Arg Gly 
960 965 970 

ACG GTC TGG ACC GGA GCG CTC TCC GCG GCC ACC CTC CCC TCG GCC GCG 
Thr Val Trp Thr Gly Ala Leu Ser Ala Ala Thr Leu Pro Trp Ala Ala 
975 980 935 

GAC CAC AGC GTC CAC GGC CGC ACC GTC CTC CCG GGC ACC GCC CTC CTC 
Asp His Ser Val His Gly Arg Thr Val Leu Pro Gly Thr Ala Leu Leu 
"0 995 1000 

GAC CTC GCG CTC CAC GCC GCC CCG CGC GTC GGC GAG TTC ACC TTC GAG 
Asp Leu Ala Leu His Ala Ala Pro Arg Val Gly Glu Leu Thr Phe Glu 
1005 1010 1015 

GCG CCG CTC GTC CTC CCG GAG GAC GGA GAG GTC CGG CTC CGC GTC GTA 
Ala Pro Leu Val Leu Pro Glu Asp Gly Glu Val Arg Leu Ara Val Val 
1020 1025 1030 1035 

CTC GCT GAA CCG GAC GCG AGC GGA GTA CGC GAA CTC TCT GTC CAC TCC 
Leu Ala Glu Pro Asp Ala Ser Gly Val Arg Glu Leu Ser Val His Ser 
!°40 1045 1050 

GCC GGC GAG GAC GGC GGC TGG ACC CGG CAC GCG ACA GCG GTC CTC GAC 
Ala Gly Glu Asp Gly Gly Trp Thr Arg His Ala Thr Ala Val Leu Asp 
1055 1060 1065 

ACC GGC ACC ACC ACC GCC GGG GAG CCC GCC GGC GCA CCG CCC GCC GCA 
Thr Gly Thr Thr Thr Ala Gly Glu Pro Ala Gly Ala Pro Pro Ala Ala 
1070 1075 1080 

TCG CCG CCC GGG GAC GCC GAA CCC CTC GAC CTT GCC GCC GAG TAC GAG 
Trp Pro Pro Gly Asp Ala Glu Pro Leu Asp Leu Ala Ala Glu Tyr Glu 
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1085 



1090 



1095 



70 



75 



20 
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30 
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40 



45 



50 



55 



CGC TTC GCC GAC GCC GGC ATC GGA TAC GGC CCC GCC TTC CGC GGA CTG 
Arg Phe Ala Asp Ala Gly lie Gly Tyr Gly Pro Ala Phe Arg Gly Leu 
1100 U05 mo ins 

CCC TCC GCC TCG CGC GAC GGC GAC GCG ATA CTG GCC GAC GTA CGG CTC 
Arg Ser Ala Trp Arg Asp Gly Asp Ala lie Leu Ala Asp Val Arg Leu 
1120 H25 H30 

CCC GGC GAA CTG GCC GGC GAA GCC GAC CGG TAC GGC ATC CAC CCG GCC 
Pro Gly Glu Leu Ala Gly Glu Ala Asp Arg Tyr Gly lie His Pro Ala 
1135 H40 H45 

CTC CTC GAC GCC GCC CTC CAC ACC GCG GCC GCC GCC CTC GGA GGG GCG 
Leu Leu Asp Ala Ala Leu His Thr Ala Ala Ala Ala Leu Gly Gly Ala 
1150 U55 U60 

CAC GGG ATC CTC CCG TTC ACG TCG AAC GGC GTC ACC CTC CAC GCC CGC 
His Gly Met Leu Pro Phe Thr Trp Asn Gly Val Thr Leu His Ala Aro 
1165 H70 H75 

GGA GCG CAC GCC ATC CGG GTC CGG CTC ACC CCG GCC GGC CCC GAC GCG 
Gly Ala His Ala He Arg Val Arg Leu Thr Pro Ala Gly Pro Asp Ala 
1180 1185 iiso * U95 

GTC GCG GTC ACC GCC GTC GAC CCG GCG GGG CGC CCC GTC TTC ACG GCC 
Val Ala Val Thr Ala Val Asp Pro Ala Gly Arg Pro Val Phe Thr Ala 
1200 1205 1210 

GCC TCG CTC ACC CTC CGA CCG GTC ACG ACC GGG CAG CTC ACC GCG GCC 
Ala Ser Leu Thr Leu Arg Pro Val Thr Thr Gly Gin Leu Thr Ala Ala 
1215 1220 1225 

GAG GCC GCG CGG GCC CCG CTC TAC CGG GTC CGC TCG ACC GGC CTC CCG 
Glu Ala Ala Arg Ala Pro Leu Tyr Arg Val Arg Trp Thr Gly Leu Pro 
1230 1235 1240 

GAC ACC GGA ACC GCC CGG GAC CAC ACC TOG GCG GTC GCC GGC GGC CCG 
Asp Thr Gly Thr Ala Arg Asp His Thr Trp Ala Val Ala Gly Gly Pro 
1245 1250 1255 

GGC GAC CTC TTA CCC GGG GAG ACC CCG CAC CAC CCC GAC CTC GCC TCC 
Gly Asp Leu Leu Pro Gly Glu Thr Pro His His Pro Asp Leu Ala Ser 
1260 1265 1270 1275 

GCG CTC GCC GAC ACC GGC ACC GCC CCC TTC CGC GTA CTC GCG GAT CTC 
Ala Leu Ala Asp Thr Gly Thr Ala Pro Phe Arg Val Leu Ala Asp Leu 
1280 1285 1290 

CGC GGA TAC GGC ACC GCC ACC CCC CGG GAA CTC GCC TCC CAG GCG CTC 
Arg Gly Tyr Gly Thr Ala Thr Pro Arg Glu Leu Ala Ser Gin Ala Leu 
1295 1300 1305 

GCC CTC GTC CAG CAG TCG GCC GAC GCG GCC GAG GCC GCC GAA GGC AGG 
Ala Leu Val Gin Gin Trp Ala Asp Ala Ala Glu Ala Ala Glu Gly Aro 
13X0 1315 1320 

CTC GTC CTC GTC ACA CGC CGG GCC GTC GAC ATC GGT GAC GGC GTC ACG 
Leu Val Leu Val Thr Arg Arg Ala Val Asp He Gly Asp Gly Val Thr 



17695 



17743 



17791 



17839 



17887 
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18031 
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1325 1330 



1335 



GAC CCG GCG GOG GCG ACC GTG TOG GGA CTC GTC CGG GCG GCA CAG TCC 

^L Pr ° X1 * A1 * A1 * Thr Val Gl * Uu Val ^9 *1* Ala Gin Ser 
1340 134S 1350 lass 

GAG CAC CCC GGC TGC TTC GCG CTC CTC GAC ACC GAC GAC TCC CCC CGC 
Glu His Pro Gly Cys Phe Ala Leu Leu Asp Thr Asp Asp Ser Pro Arg 
"60 1365 1370 

TCC CGG CAA CTC CTG CCA CGC GTC GCG GGC ACC GCC GAG CAG CTC GCA 
Ser Arg Gin Leu Leu Pro Arg Val Ala Gly Thr Ala Glu Gin Leu Ala 
" 7 5 1380 1385 

CTC CGC GAC GGC ACC CTG CTC GCC CCC TCC CTC ACC COT GCC ACG CTG 
Leu Arg Asp Gly Thr Leu Leu Ala Pro Ser Leu Thr Arg Ala Thr Leu 
139 » 1395 1400 

CCC GCC GGC GCC CGG CTG CCC GCA CTC GAC GGC ACC GTC CTG ATC ACT 

VT— y Ala Ar9 Leu Pro Ala Thr Val Leu lie Thr 

1405 1410 1415 

CGG GGC ACC GGC AGC CTC GGC GCG GAG GCG GCC CGC CAT CTG GTC ACC 
Gly Gly Thr Gly Ser Leu Gly Ala Glu Ala Ala Arg His Leu Val Thr 
1420 1^25 1430 1435 

CGG CAC GOT GCC CGG OX: CTG CTC CTG ACC AGC CGA A«: GGC CCG CAG 
Arg His Gly Ala Arg Arg Leu Leu Leu Thr Ser Arg Ser Gly Pro Gin 
I* 40 1445 " 1450 

GCC CCC GGC GCG GCC GAA CTC GTC GCC GAA CTG GCC GCC TTG GGC GCC 
Ala Pro Gly Ala Ala Glu Leu Val Ala Glu Leu Ala Ala Leu Gly Ala 
1455 146O 1465 

CAC GCG GAC GTC GCC GCC TGC GAC GTC GCC GAC CGC GCC GCC CTG CGG 
His Ala Asp Val Ala Ala Cys Asp Val Ala Asp Arg Ala Ala Leu Arg 
1470 1475 1480 

GCC CTG CTC GAC CGC GTA CCC GCC GGC CAC CCG CTG ACC GCG GTC CTC 
Ala Leu Leu Asp Arg Val Pro Ala Gly His Pro Leu Thr Ala Val Leu 
•** 8: > 1490 1495 

CAC ACQ GCG GGC GTC CTC GAC GAC GGC GTC CTC ACC GCC CAC ACA CCG 
His Thr Ala Gly Val Leu Asp Asp Gly Val Leu Thr Ala Sn Thr S 
1500 1505 1510 1515 

CAG CGG CTC CCG GCC GTC CTC CCC CCC AAG GCC GAC GCC GTA CGC AAT 
Gin Arg Leu Ala Ala Val Leu Arg Pro Lys Ala Asp Ala vll S £n 
1S2 ° 1525 1530 

CTC CAC GAA CTC ACC CAC GGG CAC GCC CTG TCG GCG TTC ATC CTC TAC 
Leu His Glu Leu Thr Gin Gly His Ala Leu Ser Ala Phe lie Zeu Tyr 
1 53 5 1540 1545 

TCG TCG GCC GCC GGA GTG CTC GGC AGC GCG GGC CAG AGC GGC TAC GCC 
Ser Ser Ala Ala Gly Val Leu Gly Ser Ala Gly Gin Ser Gly Tyr Ala 
1550 1555 isgo 

GCC GCC AAC GCC TAC CTC CAC TCC TTC GCC GTC TOG CGG CGG AGC CGG 
Ala Ala Asn Ala Tyr Leu Asp Ser Phe Ala Val Trp Arg Arg Ser Arg 
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1565 1570 



1575 



GGA CTG CCC GCC CTA TCG CTC OGC TCC GGC CCG TOG GAC GGC OGC GGC 

?c^« L * U Pr ° A1 * Val Ser 01 y ^ G1 V Pro T^P Asp Gly Gly Gly 
1580 1585 159 o Y Y 1S ^ S 

ATG GCC AGC GGA CTC GGC GGA ACC GAC ACG GCC CGG CTC CGG CGC AGC 
Met Ala Ser Gly Leu Gly Gly Thr Asp Thr Ala Arg Leu Arg Arg Ser 

1600 1605 " 1610 

GGC ATC GCA CCC CTC AGC CGC GCC GAG GGC CTC GCC GCC CTC GAC CCG 
Gly He Ala Pro Leu Ser Arg Ala Glu Gly Leu Ala Ala Leu Asp Ala 
1615 1620 1625 

GCC CTC CCG GCC GGC GGC GAC GAC ACC GCG CCG GCC CAC CTC CTC CCG 
Ala Leu Ala Ala Gly Gly Asp Asp Thr Ala Pro Ala His Leu Leu Pro 
1630 1635 1640 

ATC CGC CTC CAC GCC GTG ACC CTG CGC GGC GCC GAC ACC CTC CCC GCC 
He Arg v a i Asp Ala Val Thr Leu Qly AU ^ ^ ^ 

1645 1650 1655 

GTC CTC CGC GAC CTC GCG GGA ACC GCG CCA AGC GCC GCC GAA CGG CCC 
Val Leu Arg Asp Leu Ala Gly Thr Ala Pro Ser Ala Ala Glu Arg Pro 
1660 1665 1670 1675 

CCC GCG ACA CCG GAG GAC ACG AAC GCG CCC CTC GCG CAC CTC ACC CAA 
Pro Gly Thr Pro Glu Asp Thr Asn Ala Pro Leu Ala Asp Val Thr Gin 
I 680 168S 1690 

CTC CAC GGC CGG GAA CGG AAG GAG GCA CTG ACC GGC TTC CTA CGC CCC 
Leu His Gly Arg Glu Arg Lys Glu Ala Leu Thr Gly Phe Val Ara Ala 
I 695 1700 1705 

^ f, 1 ? ?? C 00(3 CTC GGC CAC CCC ACG TCC GAC ACG ATC GAC CTC 

Gin Val Ala Ala Val Leu Gly His Pro Thr Ser Asp Thr lie Asp Val 
1710 1715 1720 

CGC CGG AGC TTC AAG GAA GCG GGC TTC GAC TCC. CTC ACC GCC GTC GAA 

9 tS?c * Phe LyS G1U M * Gly Phe *** Ser Leu Thr Ala Val Glu 
1,25 1730 1735 

CTC CGC AAC CGG CTC CGC GCC GCC ACC GCG CTC AAG CTC CCC GCC ACG 
Leu Arg Asn Arg Leu Arg Ala Ala Thr Gly Leu Lys Leu Pro Ala Thr 
1/4U 1 ? 45 1750 1755 

CTC GTC TTC GAC CAC CCG ACC CCC CTC GCG CTC GCC GGC TTC CTC CAC 
Leu Val Phe Asp His Pro Thr Pro Leu Ala Leu Ala Gly Phe Leu His 
1760 1765 i77 0 

CGC GAA CTC CCC GGC GCC GAA GCC TCC CTG ATC AGC GCG ATC GAC ACT 
Arg Glu Leu Pro Gly Ala Glu Ala Ser Leu Met Ser Ala lie Asp Thr 
1 ?75 1780 i7 8 5 

CTC CCG CAC CGG CTC CGC GAC GCC CTC GCC GAC GAC GCC GCA GAC CAC 
Leu Arg His Arg Leu Arg Asp Ala Leu Ala Asp Asp Ala Ala Asp Sp 
1790 1795 1800 

GCC CTG CGC GAC CAG ATC ACC CGA CGA CTC GAG ACC CTC CTC GCC GCC 
Ala Leu Arg Asp Gin lie Thr Arg Arg Leu Glu Thr Leu Leu Ala Cly 
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ATA GCC COG ACC GAG GAG CCC GCG CCC GCC ACC GCC GCC GCC GAC GAC 

™ Ar9 Thr G1U Glu Pro Ala Pro Ala Ala Al* Ala Asp Asp 
1820 1825 1830 18 35 

CCC AGC CCA CCC GGT GAT GTC GCG GAA CGA CTG AGC ACC GCG TCG GAC 
Gly Ser Gly Ala Gly Asp Val Ala Glu Arg Leu Ser Thr Ala Ser Asp 
1840 1845 1850 

GAC GAA CTG TTC GAA CTG CTC GAC AGC GGT TTC ACA CCC TGA CCGGCCGGCG 
Asp Glu Leu Phe Glu Leu Leu Asp Ser Gly Phe Thr Pro * 

1855 I860 1865 

GACCGGCACC ACGACATCGA GGCGACCGCA CCGAACACCC CCAGGGGGAA AGCT 

GTG TCC ACC GAG AAC TCC ACC AAC GTC CCG GCG AGC GAG 
Met Ser Thr Glu Asn Ser Thr Asn Val Pro Ala Ser Glu 
1 5 10 

GAC AAG CTC CGC GCC TAT CTG CGT CGC GCG ATC GCC GAC CTC CAC GAG 
Asp Lys Leu Arg Ala Tyr Leu Arg Arg Ala Met Ala Asp Leu His Glu 
" 20 25 

^ G f 7^ CGC GCC ACG GAA GCG CGC «* CAG GAG CCG ATC 

Ser Arg Glu Arg Leu Arg Ala Thr Glu Ala Arg Ala Gin Glu Pro lie 
JU 35 40 45 

GCG GTC GTC GGT ATG GGG TCC CGG TTC CCC GGT GGG GTC GGT TCG CCG 
Ala Val Val Gly Met Gly Cys Arg Phe Pro Gly Gly Val Gly Ser Pro 
50 55 60 

GAG GCG TTC TCG CGG TTC GTC GTC GAG GGG GTC GAC GCG GTT TCC CCG 
Glu Ala Leu Trp Arg Leu Val Val Glu Gly Val Asp Ala Val Ser Pro 
65 70 75 

TTT CCC GGT GAT CGT GGC TCG GAT GTC GAG GGG TTC TAC GAC CCG GAG 
Phe Pro Gly Asp Arg Gly Trp Asp Val Glu Gly Leu Tyr Asp Pro Glu 
80 85 go 

CCG GGT GTC GCG GGG AAG TCG TAT GTC CGG GAG GGG GGT TTT CTC CAT 
Pro o y Val Ala Gly Lys Ser Tyr Val Arg Glu Gly Gly p£ 2J Sis 

Asn ^ S° °f ™ GAT GA ° TTC TTC GGG ATT TCG CCG CGT GAG 

Asp Ala Ala Glu Phe Asp Ala Glu Phe Phe Gly lie Ser Pro Arg Glu 
1 115 120 125 

M G SJ? CCG CAG CAG CGG CTC TTC CTC GAG ACC TCC TCG 

Ala Val Ala Met Asp Pro Gin Gin Arg Leu Leu Leu Glu Thr Ser Trp 
130 135 140 

GAG GCG ATC GAG CGG GCG GGT ATC GAC CCG CAT TCG CTG CAC GGC AGC 
Glu Ala lie Glu Arg Ala Gly lie Asp Pro His Ser lZ Sis £y s£ 

150 

f GC ^ ^ GTC TAC 000 000 GTG ATC TAC CAC GAC TAT GGC ACG GGA 
Arg Thr Gly Val Tyr Ala Gly Val Met Tyr His Asp Tyr Gly {£ SJ 

165 170 
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CAG ACC TCC GCG ACC GAC ACG AGC GGT TAT TCC GGC ACC GGT ACG TCG 20576 
Gin Thr Ser Ala Thr Asp Thr Ser Gly Tyr Ser Gly Thr Gly Thr Ser 
175 180 185 

GGG AGT GTG GTG TCG GGG CGT GTG GCC TAC ACG CTG GGG CTG GAG GGT 20624 
Gly Ser Val Val Ser Gly Arg Val Ala Tyr Thr Leu Gly Leu Glu Gly 
1*0 195 200 205 

CCG GCC GTG ACC GTG GAT ACG GCG TGT TCG TCG TCG TTG GTG GCG TTG 20672 
Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu 
210 215 220 

CAT CTG GCG GTG CAG GCG TTG CGG GGT GGC GAG TGT GAC ATG GCG TTG 20720 
His Leu Ala Val Gin Ala Leu Arg Gly Gly Glu Cys Asp Met Ala Leu 
225 230 * 235 

GCC GGT GGT GTG ACG GTG ATG GCC GGG CCG GGG ATG TTC GTG GAG TTT 20768 
Ala Gly Gly Val Thr Val Met Ala Gly Pro Gly Met Phe Val Glu Phe 
240 245 250 

TCG CGG CAG CGG GGG TTG GCG GCC GAT GGG CGG TGC AAG GCG TTC GCG 20816 
Ser Arg Gin Arg Gly Leu Ala Ala Asp Gly Arg Cys Lys Ala Phe Ala 
255 260 265 



GAT GGG GCG GAT GGG ACC GCT TGG GCC GAG GGT GCG GGG GTG GTG CTG 20864 
Asp Gly Ala Asp Gly Thr Ala Trp Ala Glu Gly Ala Gly Val Val Leu 
25 270 275 280 285 

GTG GAG CGG TTG TCG GAT GCC CGG CGG TTG GGG CAT CCG GTG TTG GCG 20912 

Val Glu Arg Leu Ser Asp Ala Arg Arg Leu Gly His Pro Val Leu Ala 
290 295 300 

30 

GTG GTG TGT GGG TCG GCG GTG AAT CAG GAC GGT GCG TCG AAT GGT TTG 20960 

Val Val Cys Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly Leu 

305 310 315 

^ ACG GCG CCG AGT GGT CCG TCG CAG GAG CGG GTG ATT CGT CAG GCG TTG 21008 

Thr Ala Pro Ser Gly Pro Ser Gin Glu Arg Val lie Arg Gin Ala Leu 
320 325 330 

GCG AAT GCG CGG TTG ACG GTG GCG GAT GTG GAT GTG GTG GAG GCG CAT 21056 
Ala Asn Ala Arg Leu Thr Val Ala Asp Val Asp Val Val Glu Ala His 
40 335 340 345 

GGG ACG GGG ACG CGG CTG GGT GAT CCG ATC GAG GCG CAG GCG TTG CTG 21104 
Gly Thr Gly Thr Arg Leu Gly Asp Pro He Glu Ala Gin Ala Leu Leu 
350 355 360 365 

45 

GGG ACG TAT GGG CGG GAT CGT GAT GGT GGG CGT CCG GTG TGG TTG GGG 21152 
Gly Thr Tyr Gly Arg Asp Arg Asp Gly Gly Arg Pro Val Trp Leu Gly 
370 375 380 

TCG TTG AAG TCG AAT ATT GGT CAT GCT CAG GCG GCT GCG GGG GTG GCT 21200 
Ser Leu Lys Ser Asn He Gly His Ala Gin Ala Ala Ala Gly Val Ala 
385 390 395 

GGT GTG ATC AAG ATG GTG TTG GCG ATG CGG TAT GGG TGG TTG CCG CGG 21248 
Gly Val He Lys Met Val Leu Ala Met Arg Tyr Gly Trp Leu Pro Arg 
55 400 405 " 410 
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ACG TTG CAT GTG CAT GAG CCG AGC CGG CAT GTG GAC TGG TOG GCT GCT 21296 
Thr Leu Hxs Val Asp Glu Pro Ser Arg His Val Asp Trp Ser All Gly 
415 420 425 

^Tf^^T 10 CTG ACC GAG GCG CGG CAG TGG CCG GGG GTG GAC CGG 21344 
Gly Val Trp Leu Leu Thr Glu Ala Arg Clu Trp Pro Gly Val Asp Arg 
430 435 440 445 

CCG CGT CGG GCG GCG GTC TCC GCC TTT GOT GTC ACT GGT ACC AAC GCC 
Pro Arg Arg Ala Ala Val Ser Ala Phe Gly Val Ser Gly Thr Asn Ala 
450 455 460 

CAT CTG ATC CTC GAA GCC CCC GAC ACC GCC GAG GCG GAG AGC GCC ACG 
His Leu He Leu Glu Ala Pro Asp Thr Ala Glu Ala Glu Ser Ala Thr 
465 470 475 

ACC CCG GTC CGC TCT GAG GTG TCG GAG TCT GCT GCG GTC TTC GAT GCC 
Thr Pro Val Arg Ser Glu Val Ser Glu Ser Ala Ala Val Phe Asp Ala 
480 485 490 



20 CGC AGT GGT GTG GTG CCG GTG GTG GTT TCG GGG CGT TCG CGG GTG GTG 

Arg Ser Gly Val Val Pro Val Val Val Ser Gly Arg Ser Arg Val Val 
495 500 505 

vl? T° ?f GCG GGC CGG TTG GCG GAG GTG GTG GAG GCC GGT GGT 

25 9 * A1 * Gly LeU Ala Glu Val v «l Glu Ala Gly Gly 

515 520 525 

GTG GGG CTG GCG GAT GTG GCG GTG ACG ATG GCG GGC CGG TCG CGG TTT 
Val Gly Leu Ala Asp Val Ala Val Thr Met Ala Gly Arg Se^ lf g ™ 
530 535 " 540 

GGG TAT CGG GCG GTT GTG CTG GCT CGG GGT GAG GCT GAG CTT GCC GGG 
Gly Tyr Arg Ala Val Val Leu Ala Arg Gly Glu Ala Glu Leu Ala G^ 
545 550 555 



v^.« ^ r max UCG GAC GCG GGT GTG GTC ACG 

Arg Leu Arg Ala Leu Ala Gly Gly Asp Pro Asp Ala Gly Val Val Thr 
i6 ° 565 



650 



21392 



21440 



21488 



21536 



21584 



21632 



21680 



™ GCG TTG GCG GGG GGT GAT CCG GAC GCG GGT GTG GTC ACG 

Gly 
570 

?*? ^ GAG ACG GGG TCC GGT GGT GGG GGG GTG GTG 

Gly Ala Val Val Asp Pro Glu Thr Gly Ser Gly Gly Gly Gly Val Val 

TTG GTT TTC CCT GGT CAG GGG ACG CAG TGG GTG GGG ATG GGT GCG GGC 
Leu Val Phe Pro Gly Gin Gly Thr Gin Trp Val Gly 2? SJ S Gly 
595 600 60S 

^ ^ ^° TCG GAG GTG TTT GCG GCG TCG ATG CGG GAG TCT GCG 

Leu Leu Gly Ser Ser Clu Val Phe Ala Ala Ser Met Arg Glu Cys Ala 
610 615 620 

Art 5™ f T fTT CAT GTG GGG TGG GAT TTG CTG GAG GTG GTG TCG 21920 

Arg Ala Leu Ser Val His Val Gly Trp Asp Leu Leu Glu Val Val Ser 
625 630 635 

G^ S° 7° %° CGG GTC GAT GTC GTC GAG CCG GTG ACG TGG 
Gly Gly Ala Gly Leu Glu Arg Val Asp Val Val Gin Pro Val Thr Trp 
55 o % v 645 ^ 
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10 



15 



^ w"? ^ ^ CTG GCC CGG TAC 100 CAG GCG ATC GGT CTG GAC 

Ala Val Met Val Ser Leu Ala Arg Tyr Trp Gin Ala Met Gly Val Asp 

SI? ST S? 8 S"? °T T C ^ T TCC CAG 0(30 GAG ATC GCT GCC ACG 22064 
Val Ala Ala Val Val Gly His Ser Gin Gly Glu lie Ala Ala Ala Thr 

670 67 5 680 68S 

GTG GCG GGG GCG TTG TCG CTG GAG GAT GCG GCG GCT GTG GTC GCT CTG 22112 
Val Ala Gly Ala Leu Ser Leu Glu Asp Ala Ala Ala Val Val Ala Leu 
690 695 700 

CGG GCG GGG TTG ATT GGC CGG TAT CTG GCG GGT CGT GGT GCG ATG GCG 22160 
Arg Ala Gly Leu lie Gly Arg Tyr Leu Ala Gly Arg Gly Ala Met Ala 
7 °5 710 715 

GCT GTT CCG CTG CCT GCC GGC GAG GTC GAG GCC GGG CTG GCG AAG TCG 

7^n Pr ° Ala Gly G1U Val Glu Ala G1 y Leu A l* Lys Trp 

720 725 730 

20 i° G 5? T GT ? GAG GTC GCG GCG GTC AAC GGT CCG GCG TCT ACG GTG GTT 

Pro Gly Val Glu Val Ala Ala Val Asn Gly Pro Ala Ser Thr Val Val 
735 740 745 

21 f T ?* GTG 000 ^ TAT GTC GCC G ^ TGT CAG GCG 

^ Ser Gly Asp Arg Arg Ala Val Ala Gly Tyr Val Ala Val Cys Gin Ala 

/i0 755 760 765 

Glu S3 S? rf ff C0 ° 1,1X3 ATA CCG 010 GAC TAC «* TCT CAC 
Glu Gly Val Gin Ala Arg Leu He Pro Val Asp Tyr Ala Ser His Ser 

770 775 780 

Aro 21 £? S° f° ^ GAG 1X0 GAG CGG GTC CTG TCC GGT 

Arg His Val Glu Asp Leu Lys Gly Glu Leu Glu Arg Val Leu Ser Gly 

785 790 795 

35 jTf GGC p GG f C f T G CG CGG GTG CCG GTG TGT TCC ACC GTC GCC GGA 22448 

9 «™ ^ 9 Pr ° ** 9 Val Pro Val Ser Thr Val Ala Gly 

800 8 05 810 

GAG CAG CCG GGC GAG CCG GTT TTC GAT GCG GGG TAT TCG TTC CGT AAT 
Glu G n Pro Gly Glu Pro Val Phe Asp Ala Gly Tyr Txp Z Arg JS 

°20 325 

^ ^ GTT GAG ^ TCC GCG GTC GTC GGT GGT TTG TTG GAG 

Leu Arg Asn Arg Val Glu Phe Ser Ala Val Val Gly Gly Leu 2 2! 

835 840 845 

GAG GGC CAC CGT CGG TTC ATC GAG GTC AGT GCC CAC CCG GTA CTC GTT 
Glu Gly Hxs Arg Arg Phe lie Glu Val Ser Ala His Pro Val ITu 5a7 
850 955 860 

50 r7 GAG CAG AC ° 000 GAG GCC GCG G AC CGG AGT GTC CAT GCC 22640 

Hxs Ala lie Glu Gin Thr Ala Glu Ala Ala Asp Arg Ser VaT His S 
865 870 875 

£ ^ £° ^ ~ C 000 ° AG GAC GAC AGC CCG CAC CGC CTG CTG ACC 22688 
Thr Gly Thr Leu Arg Arg Gin Asp Asp Ser Pro His Arg Leu Leu ihr 

55 880 885 890 
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TCC ACC CCC GAG CCC TGG GCC CAC OGC GCC ACC CTC ACC TGG GAC CCC 
Ser Thr Ala Glu Ala Trp Ala His Gly Ala Thr Leu Thr Trp Asp Pro 
tt 5 ** 900 90s 

GCC CTC CCC CCA GGC CAC CTC ACC ACC CTC CCC ACC TAC CCC TTC AAC 
Ala Leu Pro Pro Gly His Leu Thr Thr Leu Pro Thr Tyr Pro Phe Asn 
910 915 920 925 

CAC CAC ^AC TAC TGG CTC GAC ACC ACC CCC ACC ACC CCC GCG ACG ACC 
His His His Tyr Trp Leu Asp Thr Thr Pro Thr Thr Pro Ala Thr Thr 
930 935 940 

ACC CAG AGC CCC ACC GAT GCC CAG AAC CCC GCC GAC GCC CTT CCC TAC 
Thr Gin Ser Pro Thr Asp Ala Gin Asn Pro Ala Asp Ala Leu Pro Tyr 
945 950 955 

AAG GTC ACT TGG AAG CGG TTG CGG GAC CAG GAC AGC TTG ACC GCG CGC 
Lys Val Ser Trp Lys Arg Leu Arg Asp Gin Asp Ser Leu Thr Ala Arg 
960 965 970 

CTC GAC GGC CGA TGG CTG CTC GTG GTA CCG GAG GCG TCC GCG GAC CCG 
Leu Asp Gly Arg Trp Leu Leu Val Val Pro Glu Ala Ser Ala Asp Pro 
975 gao 985 

TCG CTT GCT GAG GGC GTC GCG CGC GAG CTG ACC GCG CGG GGC GCG ACC 
Ser Val Ala Glu Gly Val Ala Arg Glu Leu Thr Ala Arg Gly Ala T~£ 
990 995 1000 1005 

vIY 3° G CTG ACG GTC GAG CCG «* GCC GAC CGT TCG CGG CTG CGC 

Val Glu Ser Leu Thr Val Glu Pro Gly Ala Asp Arg Ser Arg Leu Arg 
1010 1015 102 0 

GGG CTG CTG GTC GAC GCC ACG GAA CGG GAC GAA GCC GGG CCG CTG CGC 
Gly Leu Leu Val Asp Ala Thr Glu Arg Asp Glu Ala Gly Pro LeS Arg 
1025 1030 1035 

ctJ t£ T G f TC CTG GCG TTG GCC GGG GAC CAC GCC GGG GCC GAC 

7 * ^ LeU AU LeU Ala G1 * Aa ? His A1 * Gly Ala Asp 

1040 1045 1050 

GGG GCA CGC CCG GTG GTT CCG GCC GGC CTG GCA GCC TCA CTG GCG CTC 
Gly Ala Arg Pro Val Val Pro Ala Gly Leu Ala Ala S S S 2S 
10b5 1060 1065 

JK Gin S° ^ GAC GCG 030 ACG GAG GCC GGG CTG TGG GCG GTC 
lo70 ^ A ^c A1 * Gly Thr G1U Ala G ^ Leu Trp Ala Val 

1075 1080 " 1085 

ACC CGC GGC GCC GTC GCG GCC GTC CCC GGT GAC GTA CCG GCG CCG TCG 
Thr Arg Gly Ala Val Ala Ala Val Pro Gly Asp Val Pro Sa £ £r 
1090 1095 uoo 

Sn Al"a ^ CTC TGG GGA TTC GGC CGG GTC GCC GGG ATC GAG CTC CCG 
Gin Ala Leu Leu Trp Gly Phe Gly Arg Val Ala Gly lie Glu Leu Pro 
1105 1110 ins 

CAC TCC TGG GGC GGC CTC CTC GAC CTC CCG ACC GGG CCC GGC GAC TCC 
His Cys Tr^Gly Gly Leu Leu Asp^Leu Pro Thr Gly S SJ Asp 2r 
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GGG TTC CGG CAA CTG GCC GCG ACC CTG GCG GGC CGT CCC GCG GAG GAC 
Gly Phe Arg Gin Leu Ala Ala Thr Leu Ala Gly Arg Pro Ala Glu Asp 
1135 1140 1145 



23456 



CAG GTG GCG CTG CGG GCT TCG GGC GCG TAC GGC CGC AGG CTG GTC CGG 
Gin Val Ala Leu Arg Ala Ser Gly Ala Tyr Gly Arg Arg Leu Val Arg 
1150 1155 1160 1165 



23504 



GCC TCC GCG GCG GGC GGC GCG GAC GGC TGG CGG CCG CGG GGA ACG GTG 
10 Ala Ser Ala Ala Gly Gly Ala Asp Gly Trp Arg Pro Arg Gly Thr Val 

1170 1175 1180 



23552 



75 



CTG GTG GTG GGC GAC ACC GCG GAG GTC GCC GGA CCG CTG GTG CGC TGG 23600 
Leu Val Val Gly Asp Thr Ala Glu Val Ala Gly Pro Leu Val Arg Trp 
1185 1190 1195 

CTG CTC GGC AAC GGC GCA CGG CGG GTG ACC CTG TCC GGA CTC TCC GGC 23648 
Leu Leu Gly Asn Gly Ala Arg Arg Val Thr Leu Ser Gly Leu Ser Gly 
1200 1205 1210 



20 CCG CTG CCG GAG GAA CTC GCC GAT GTG GCG GCA CGG GTG ACC GTG GCG 

Pro Leu Pro Glu Glu Leu Ala Asp Val Ala Ala Arg Val Thr Val Ala 
1215 1220 1225 



23696 



25 



CCC TGT GAT CCG GCC GAT CGC CCC GCC CTG CGG ACG CTG CTC GCC GAA 
Pro Cys Asp Pro Ala Asp Arg Pro Ala Leu Arg Thr Leu Leu Ala Glu 
1230 1235 1240 1245 



23744 



30 



CAG GCG CCG ACC GCC GTG CTC GTG GCG CCC CCG GCC GTC CCG CCC ACG 23792 
Gin Ala Pro Thr Ala Val Leu Val Ala Pro Pro Ala Val Pro Pro Thr 
1250 1255 1260 

CCG CTC GCG GAG ATG ACC GCC GAG GCG TTA GCC ATC GCG CTG TCC GCG 23840 
Pro Leu Ala Glu Met Thr Ala Glu Ala Leu Ala lie Ala Leu Ser Ala 
1265 1270 1275 



35 



AAG ACC GGT CTG GTC GAC CGT CTG GAC TCG CTG CTC GAC GAG CCG GAC 
Lys Thr Gly Leu Val Asp Arg Leu Asp Ser Leu Leu Asp Glu Pro Asp 
1280 1285 1290 



23888 



40 



CCC CTG CTC GAG GAC GGG GAA CTC GAC GCG TTC GTC GTC TTC TCC TCC 
Pro Leu Leu Glu Asp Gly Glu Leu Asp Ala Phe Val Val Phe Ser Ser 
1295 1300 1305 



23936 



45 



GTG GCA GGG GTG TGG GGC GGC GCG GGA CAG GGT GGT TAC GCG GCC GGT 23984 
Val Ala Gly Val Trp Gly Gly Ala Gly Gin Gly Gly Tyr Ala Ala Gly 
1310 1315 1320 1325 

ACC GCG TAC CTC GAC GCG CTC GCC GAA TGC CGG CGG GCC GGG GGG CTG 24032 
Thr Ala Tyr Leu Asp Ala Leu Ala Glu Cys Arg Arg Ala Gly Gly Leu 
1330 1335 1340 



50 



CCG GTC ACC TCG GTG GCG TGG ACG CCG TGG CTC GGT ACG CCG GCG GCG 
Pro Val Thr Ser Val Ala Trp Thr Pro Trp Leu Gly Thr Pro Ala Ala 
1345 1350 1355 



24080 



55 



GAC TCC CTG GGC GAG CAG ATG AGC CGA GCT GGC ATC ACC CCC CTG GAT 
Asp Ser Leu Gly Glu Gin Met Ser Arg Ala Gly He Thr Pro Leu Asp 
1360 1365 1370 
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CCG GCG GCC TCG CTG GAT GCG CTC GCC CGT GCG GTC GGC CGG CGC GCG 

Pr ° f^L Ala Ser L6U Ala Leu Ala Ala Val G1 Y Arg Arg Ala 
13 '5 1380 1385 

GGC TGT GTG ACG GTC GCC GAC ATC GAC TGG GAG CGG TTC GCC TCC GCG 
Gly Cys Val Thr Val Ala Asp lie Asp Trp Glu Arg Phe Ala Ser Ala 
1390 1395 1400 1405 

TAC ACG GCC ACC CGT CCC ACG CCG ATG TTC GAC GAG GTG CCC GAG GTC 
Tyr Thr Ala Thr Arg Pro Thr Pro Met Phe Asp Glu Val Pro Glu Val 
1410 1415 1420 

CGG' CGG ATA CAG GCC GCG TCG GCG GAA GCG GAG GCC GAC GCC GCG CGC 
Arg Arg lie Gin Ala Ala Trp Ala Glu Ala Glu Ala Asp Ala Ala Arg 
1425 1430 1435 

AGC GGT GCC GGC GGC GAC TCG CAG CTG CTG CGC TCC CTC CGG GGC CGG 
Ser Gly Ala Gly Gly Asp Ser Gin Leu Leu Arg Ser Leu Arg Gly Ara 
1440 1445 1450 y 

CCC GAG GAG GCC CAA CTG GCG GAG CTG CTG CGG CTG GTG CGC ACC CAT 
Pro Glu Glu Ala Gin Leu Ala Glu Leu Leu Arg Leu Val Arg Thr His 
!455 1460 i4 6 5 

GCC GCC GCG GTG CTC GGC CTG GGC TCG CCC GGC GCG GTG GAG GCG CGG 

V,™ Val ** u Gly LeU Gly Ser Pro G1 V Ala Val Glu Ala Arg 

1470 1475 148O 1485 

CGT TCG TTC AAG GAC CTG GGC TTC AAC TCG GTG ACG GCG GTG GAG CTG 
Arg Ser Phe Lys Asp Leu Gly Phe Asn Ser Val Thr Ala Val Glu Leu 
1490 1495 1500 

CGG AAC CGG CTC AAG GAG GCG ACG GGA CTC CGG CTC GAG GTC TCC CTC 
Arg Asn Arg Leu Lys Glu Ala Thr Gly Leu Arg Leu Glu Val Ser Leu 
15 °5 1510 1515 

GTC TTC GAC CAC CCG GAC CCG GCC TCC CTC GCC CGG CAT CTC CTC GAT 
Val Phe Asp His Pro Asp Pro Ala Ser Leu Ala Arg His Leu Leu Asp 
1520 1525 1530 

CTC GCC CTC GGC CAG GAG CCG GAG GAG ACG CCG CGG GCG TTC GCG CTC 
Leu Ala Leu Gly Gin Glu Pro Glu Glu Thr Pro Arg Ala Phe Ala Leu 
lb35 1540 1545 

GAA CCC GCG CCG AAC GGG GAG CCG ATC GCG ATC GTC TCC ATC GCC TCC 
Glu Pro Ala Pro Asn Gly Glu Pro lie Ala lie Val Ser Met Ala Cys 
1550 1555 1560 1565 

CGT ATC CCG GGG GGT GTC AGC ACG CCC GAG GAG CTC TCG CGG CTC CTC 
Arg Met Pro Gly Gly Val Ser Thr Pro Glu Glu Leu Trp Arg Leu Le^ 
1570 1575 isso 

T° ^ f*° GAC GCG ATC 000 CCG CCC SCC AAC CGG GGC TCG 

Arg Asp Gly Lys Asp Ala lie Gly Pro Phe Pro Ala Asn Arg Gly Trp 
158 5 1590 1595 

GAC CTC GAG AAC CTC TAC GAC CCC GAC CCG GAC GCC GAC GGC CGC ACC 
Asp Leu Glu Asn Leu Tyr Asp Pro Asp Pro Asp Ala Asp Gly £g Jhr 
1600 1605 i 6 io 
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TAT GTC CGC GAG GGC GGA TTC CTC CAC GAG GCA CCG GAC TTC GAC CCC 
Tyr Val Arg Glu Gly Gly Phe Leu His Glu Ala Pro Asp Phe Asp Pro 
1615 1620 1625 

TCG TTC TTC GGC ATC TCG CCG CGC GAG GCG CTG GCG ATG GAC CCG CAG 
Ser Phe Phe Gly lie Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin 
1630 1635 1640 1645 

CAG CGG CTG CTG CTG GAG ACT TCC TGG GAG GCC TTC GAG CGC GCC GGC 
Gin Arg Leu Leu Leu Glu Thr Ser Trp Glu Ala Leu Glu Arg Ala Gly 
1650 1655 1660 

ATC GAC CCG GCG AGG CTG CGC GGC AGC CGT ACC GGC GTC TTC GTC GGG 
lie Asp Pro Ala Arg Leu Arg Gly Ser Arg Thr Gly Val Phe Val Gly 
1665 1670 1675 

ACG AAC GGC CAG CAC TAC ATG CCG CTG CTC CAG AAC GGC GGG GAC AGC 
Thr Asn Gly Gin His Tyr Met Pro Leu Leu Gin Asn Gly Gly Asp Ser 
I 680 1685 1690 

TTC GAC GGC TAT CTG GGC ACC GGC AAC TCG GCG ACT GTG ATG TCG GGC 
Phe Asp Gly Tyr Leu Gly Thr Gly Asn Ser Ala Ser Val Met Ser Gly 
1695 1700 i7 05 y 

CGG CTG TCG TAC GTG TTC GGC CTC GAA GGC CCC GCC GTG ACC GTG GAC 
Arg Leu Ser Tyr Val Phe Gly Leu Glu Gly Pro Ala Val Thr Val Asp 
1710 1715 1720 1725 

ACC GCC TGC TCG GCG TCC CTG GTG GCG CTG CAC CTG GCG GTG CAG GCG 
Thr Ala Cys Ser Ala Ser Leu Val Ala Leu His Leu Ala Val Gin Ala 
1730 1735 1740 

ATC CGG CGC GGC GAG TGC GAC ATC GCG CTC GTC GGC GGC GCG ACG GTC 
Met Arg Arg Gly Glu Cys Asp Met Ala Leu Val Gly Gly Ala Thr Val 
1745 1750 " 1755 

ATC TCG ACG CCC GAG ATC CTG GTC GAG TTC TCC CGG CAG CGG GTC ATC 
Met Ser Thr Pro Glu Met Leu Val Glu Phe Ser Arg Gin Arg Val lie 
17 60 1765 177 0 

^ f C f 00 AGG 000 TO GCC «* GCC GAC GGT GTG 

Ser A ** Asn ^ Ser ^9 Ala Phe Ala Ala Gly Ala Asp Gly Val 
1775 1780 1785 

GCG CTC GGC GAG GGC GTC GGC GTC CTC CTG GTG GAG CGG CTG TCG GAC 
Ala Leu Gly Glu Gly Val Gly Val Leu Leu Val Glu Arg Leu Ser Asp 
1790 1795 1800 1805 

GCC GAG CGC AAC GGC CAT CCG GTC CTC GCG GTG GTC CGC GGC TCG GCG 
Ala Glu Arg Asn Gly His Pro Val Leu Ala Val Val Arg Gly Ser Ala 
1810 1815 1820 

^ C %° ? AC 000 000 TCC AAC 000 CT G ACG GCG CCC AAC GGG CCC 
Val Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro 
1825 1830 18 3s 

™ C r ^ CAG CGG GTC ATC CGG CAG GCG CTG GCG GAC GCC GGG CTG CGG 
Ser Gin Gin Arg Val lie Arg Gin Ala Leu Ala Asp Ala Gly Leu Arg 
I 840 1845 1850 
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CCC GAG GAC ATC GAC GCC GTC GAG GCG CAC GGC ACC GGC ACC GAG CTG 
Pro Glu Asp He Asp Ala Val Glu Ala His Gly Thr Gly Thr Glu Leu 
1855 i860 1865 

GGC GAC CCC ATC GAG GCC GAG GCG CTG CTC GCC ACC TAT GGA AGG ACC 
Gly Asp Pro He Glu Ala Glu Ala Leu Leu Ala Thr Tyr Gly Arg Thr 
1870 1875 1880 1885 

CGT ACG GCG GAC CGC CCG CTG TGG CTC GGC TCC CTG AAG TCC AAC ATC 
Arg Thr Ala Asp Arg Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn He 
1890 1895 1900 

GGG CAC ACC CAG GCC GCC GCC GGC GTG GCG GGC GTC ATC AAG ATG GTG 
Gly His Thr Gin Ala Ala Ala Gly Val Ala Gly Val He Lys Met Val 
1905 1910 1915 

CTC GCG CTG GGC AAC GAG ACA CTG CCG CGC ACC CTG CAC GTG GAT GAG 
Leu Ala Leu Gly Asn Glu Thr Leu Pro Arg Thr Leu His Val Asp Glu 
1920 1925 1930 

CCC ACA CCG CGC GTG GAC TGG TCC TCT GGC GCG GTC TCC CTG CTC ACC 
Pro Thr Pro Arg Val Asp Trp Ser Ser Gly Ala Val Ser Leu Leu Thr 
1935 1940 1945 

GAG CCG GTG GAC TGG CCC GCC GGC CCG TCC GCG CCG CGC CGT GCG GCC 
Glu Pro Val Asp Trp Pro Ala Gly Pro Ser Ala Pro Arg Arg Ala Ala 
1950 1955 i960 1965 

GTG TCC TCG TTC GGC ATC AGC GGC ACC AAC GCC CAC ACG ATC CTG GAG 
Val Ser Ser Phe Gly He Ser Gly Thr Asn Ala His Thr He Leu Glu 
I 970 1975 198O 

CAG GCC CCC GTC CCC GCG GAG TCC CGC CCC GGG ACG GAG CCG GCG GAC 
Gin Ala Pro Val Pro Ala Glu Ser Arg Pro Gly Thr Glu Pro Ala Asp 
1985 1990 1995 

GGC ACG GGC GCG TGG GAG AAC GTG ACC GTT CCG CTG CTG CTG TCC GGC 
Gly Thr Gly Ala Trp Glu Asn Val Thr Val Pro Leu Leu Leu Ser Gly 
2000 2005 2010 

CAC ACC GAG GCG GCG CTG CGC GAG CAG AGC ACG AGG CTG CTG AAC GAC 
His Thr Glu Ala Ala Leu Arg Glu Gin Ser Thr Arg Leu Leu Asn Asp 
2015 2020 2025 

CTG CTG GAG CAC CCG GAC GAG CAC CCG GCC GAC GTC GGC TAC ACC CTG 
Leu Leu Glu His Pro Asp Glu His Pro Ala Asp Val Gly Tyr Thr Leu 
2030 2035 2040 2045 

ATC ACC GGC AGG GCC CAC TTC GGG CAC CGG GCC GCC GTG ATC GGC GAG 
He Thr Gly Arg Ala His Phe Gly His Arg Ala Ala Val lie Gly Glu 
2050 2055 2060 

AGC CGG GAA GAA CTG CTC GAC GCC CTG AAG GCT CTC GCC GAG GGC CGC 
Ser Arg Glu Glu Leu Leu Asp Ala Leu Lys Ala Leu Ala Glu Gly Arg 
2065 2070 2075 

GAG CAC CAC ACC GTG GTA CGG GGC GAC GGG ACG GCC CAC CCG GAC CGG 
Glu His His Thr Val Val Arg Gly Asp Gly Thr Ala His Pro Asp Ara 
2080 2085 2090 
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CGC GTG GTC TTC GTC TTC CCC GGG CAG GGC TCG CAG TGG CCG TCG ATC 
Arg Val Val Phe Val Phe Pro Gly Gin Gly Ser Gin Trp Pro Ser Met 
2095 2100 2105 



26336 



GCC CGG GAC CTG CTC GAC CGC GCG CCC GCC TTC CGC GAG ACG GCG AAG 
Ala Arg Asp Leu Leu Asp Arg Ala Pro Ala Phe Arg Glu Thr Ala Lys 
2110 2115 2120 2125 



26384 



GCC TGC GAC GCC GCG CTG AGC GTC CAT CTG GAC TGG TCC GTG CTC GAT 
10 Ala Cys Asp Ala Ala Leu Ser Val His Leu Asp Trp Ser Val Leu Asp 

2130 2135 2140 



26432 



15 



GTC CTC CAG GAG AAG CCG GAC GCG CCG CCG CTG AGC CGG GTC GAC GTG 26480 

Val Leu Gin Glu Lys Pro Asp Ala Pro Pro Leu Ser Arg Val Asp Val 
2145 2150 2155 

GTG CAG CCC GTG CTG TTC ACG ATG ATG CTG TCG CTC GCC GCC TGC TGG 26528 

Val Gin Pro Val Leu Phe Thr Met Met Leu Ser Leu Ala Ala Cys Trp 
2160 2165 2170 



20 CGG GAC CTC GGC GTC CAC CCG GCC GCC GTG GTG GGC CAC TCC CAG GGA 

Arg Asp Leu Gly Val His Pro Ala Ala Val Val Gly His Ser Gin Gly 
2175 2180 2185 
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GAG ATC GCG GCG GCC TGC GTG GCC GGC GCG CTC TCC CTG GAG GAC GCG 
Glu lie Ala Ala Ala Cys Val Ala Gly Ala Leu Ser Leu Glu Asp Ala 
2190 2195 2200 2205 



26624 



GCG CGG ATC GTG GCG CTG CGC AGC CGG GCA TGG CTC ACA CTG GCC GGC 26672 
Ala Arg He Val Ala Leu Arg Ser Arg Ala Trp Leu Thr Leu Ala Gly 
2210 2215 2220 

AAG GGC GGC ATG GCC GCC GTC TCC CTG CCG GAA GCC CGG CTG CGC GAG 26720 
Lys Gly Gly Met Ala Ala Val Ser Leu Pro Glu Ala Arg Leu Arg Glu 
2225 2230 2235 

CGG ATC GAG CGG TTC GGG CAG CGG CTG TCG GTG GCC GCG GTG AAC AGC 26768 
Arg He Glu Arg Phe Gly Gin Arg Leu Ser Val Ala Ala Val Asn Ser 
2240 2245 2250 

CCG GGC ACG GCG GCG GTC GCC GGT GAC GTG GAC GCG CTG CGG GAA CTG 26816 
Pro Gly Thr Ala Ala Val Ala Gly Asp Val Asp Ala Leu Arg Glu Leu 
2255 2260 2265 

CTG GCG GAG CTG ACC GCG GAG GGC ATC CGG GCC AAG CCG ATC CCC GGC 26864 
Leu Ala Glu Leu Thr Ala Glu Gly He Arg Ala Lys Pro He Pro Gly 
2270 2275 2280 2285 

GTG GAC ACG GCC GGC CAC TCC GCG CAG GTG GAC GGC CTG AAG GAG CAT 26912 
Val Asp Thr Ala Gly His Ser Ala Gin Val Asp Gly Leu Lys Glu His 
2290 2295 2300 

CTC TTC GAG GTG CTG GCG CCG GTC TCC CCG CGC TCC TCG GAC ATC CCG 26960 
Leu Phe Glu Val Leu Ala Pro Val Ser Pro Arg Ser Ser Asp He Pro 
2305 2310 2315 

TTC TAC TCG ACG GTG ACG GGC GCG CCG CTG GAC ACC GAG CGG CTG GAC 27008 
Phe Tyr Ser Thr Val Thr Gly Ala Pro Leu Asp Thr Glu Arg Leu Asp 
2320 2325 2330 
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GCC CGG TAC TGG TAC CGC AAC ATG CGG GAG CCC GTG GAG TTC GAG AAG 
Ala Gly Tyr Trp Tyr Arg Asn Met Arg Gl« Pro Val Glu p£ Gil £s 
2335 2340 2345 

GCC GTC AGG GCA CTG ATC GCC GAC GGC TAC GAC CTG TTC CTG GAG TGC 
Ala Val Arg Ala Leu lie Ala Asp Gly Tyr Asp Leu Phe Leu Glu cS 
2355 2360 2 365 

AAC CCG CAC CCG ATG CTC GCC ATG TCG CTG GAC GAG ACA CTC ACC GAC 
Asn Pro His Pro Met Leu Ala Met Ser Leu Asp Glu £ Su t£ Sp 
2370 2375 2380 

AGC GGC GGC CAC GGC ACC GTG ATG CAC ACC CTC CGC CGG CAG AAG GGC 
Ser Gly Gly His Gly Thr Val Met His Thr Leu Arg 2£ Cln £s 
2385 2390 2395 

ACC GCC AAG GAC TTC GGC ATG GCG CTC TGC CTC GCC TAT GTC AAC GGA 

£oo ASP G1V MSt Ala LeU Leu Ala IVr Val Asn SJ 
^«uo 2405 



2410 



22 au, f ° ^ A GAA CTC TTC GGC CCC GAC TCA CGC CGG GTG 

Leu Glu lie Asp Gly Glu Ala Leu Phe Gly Pro Asp Ser Arg Arg Val 
■** 1S 2 *20 2425 

AAC CCG CCG ACQ TAC CCG TTC CAG CGG GAG CGC TAC TGG TAC CAC CCC 
Ajn Pro Pro Thr Tyr Pro Phe Gin Arg Glu Arg Tyr Trp £ Hi* Pro 
Z4Ji 2440 2445 

ACG AGC GGC AGG CGC GGC GAC ATC ACG GCG GCC GGC GTG GCC GAG GCG 
Thr Ser Gly Arg Arg Gly Asp He Thr Ala Ala Gly Sa! Sa S! Sa 
2450 2455 2460 

Glu Ss 2 f™ 7* ^ 000 GTC GAA CTC CCG GAG ACC GGG GGC 

Glu His Pro Leu Leu Gly Ala Gly Val Glu Leu Pro Glu Thr Gly Gly 
2465 2470 ^ * * 

SS tl° GCG CGG TTC CGC CCG GAC AGC CGG CCG TGG CTG GCC 

Thr Val Tyr Thr Ala Arg Phe Gly Pro Asp Ser Arg Pro Trp Leu Sa 
" !4B0 2 «85 2490 

GAC CAC GCG CTG CTG GGC ACC GTG CTG CTG CCC GGC ACG GCA ATC CTG 
Asp Hxs Ala Leu Leu Gly Thr Val Leu Leu Pro Gly Thr Ala iTe ITu 
" 2500 2505 

GAC CTG GTG CTG TGG GCG GGC GAA CGC TCC GGC TGC GGC CGC GTC GTT 
AspLeu Val Leu Trp Ala Gly Glu Arg Ser Gly Cys Gly Arg Val £y 
<iSl! » 2520 2525 

GAA CTG GCG CTC CAG GCA CCG CTG GTC CTG CCG GAC AGC GGC GAC GTC 
Glu Leu Ala Leu Gin Ala Pro Leu Val Leu Pro Asp Ser Gly Asp Val 
2530 "SS 2540 

GAA CTG CGG CTG CTG GTC GGC GGC CCG GAC GAG GAG AAA CGG CGC ACC 
Gl« Leu Arg Leu Leu Val Gly Gly Pro Asp Glu Glu Lys £J S £ 
^ 4S > 2550 2555 

i£r 52 Sis IT T° o CC ?? ^ GCC 000 000 GAG CCG TGG 

2«o 9 Pr ° A1<> A1 * Ala G1 * Ala Glu A** Pro Trp 

^ 60 2 565 2570 
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ACC CGG CAC GCC GAA GCC GTG GTC CTG CCC GCC ACC GGC GAG GAG CCG 
Thr Arg His Ala Glu Ala Val Val Leu Pro Ala Thr Gly Glu Glu Pro 
2575 2580 2585 

ACC CCC GCC CCG CGC CCC GTC CCC GAG CCG GCG GGC ACC ACG GAC CCC 
Thr Pro Ala Pro Arg Pro Val Pro Glu Pro Ala Gly Thr Thr Asp Pro 
2590 2595 2600 2605 

GCC GCG TTC TAC GCG GAG TTC GCC GAG CGC GGC TAC GAC TAC GGC CCG 
Ala Ala Phe Tyr Ala Glu Phe Ala Glu Arg Gly Tyr Asp Tyr Gly Pro 
2610 2615 2620 

GCC TTC CAG GGC TTC ACC GCC GGA GCG CGC CAC GGC GAG GAC GTC GTC 
Ala Phe Gin Gly Phe Thr Ala Gly Ala Arg His Gly Glu Asp Val Val 
2625 2630 2635 

GCC GAG GTC GCG CTC CCC AGC GGC CTC GTC GCG GAC GCC CGT CAC CAC 
Ala Glu Val Ala Leu Pro Ser Gly Leu Val Ala Asp Ala Arg His His 
2640 2645 2650 

CGG CTC CAC CCG GCG CTG CTC GAC GCC GCG CTC CAG GCG ATC ATC CTC 
Arg Leu His Pro Ala Leu Leu Asp Ala Ala Leu Gin Ala Met He Leu 
2655 2660 2 665 

GGC ACG TTC TTT GCC GAC GAC GGC CGC GCC CGG ATC CCG TTC GCG GTC 
Gly Thr Phe Phe Ala Asp Asp Gly Arg Ala Arg Met Pro Phe Ala Val 
2670 2675 2 680 2685 

CGC GGA GTA CGG CTC CAC ACG GCC GGC GCC GAC CGG CTC CGC GTC CTG 
Arg Gly Val Arg Leu His Thr Ala Gly Ala Asp Arg Leu Arg Val Leu 
2690 2695 2700 

ATC TCC CCG GCG GGC GAC GAG ACC GTA CGG CTC CTC TCC ACC GAC CTC 
He Ser Pro Ala Gly Asp Glu Thr Val Arg Leu Leu Cys Thr Asp Leu 
2705 2710 2715 

GCG ACC GGC GCC CCC GTC CTG GAG ATC GAC GAA CTC GTC GTC CGC CCG 
Ala Thr Gly Ala Pro Val Leu Glu He Asp Glu Leu Val Val Arg Pro 
2720 2725 2730 

GTC TCC GGC GAG CAG TTC GCG GCC GGC GCC CCG GGC CGC AAC GGC GGC 

™ Gly G1U Gln Leu Ala Ala Ala Pro Gl y Arg Asn Gly Gly 
2735 2740 2745 

GAG CTC TAC CGG GTC GAC TGG ACG GTC CTG CCG GAG CCC GCC GAG GTC 
Glu Leu Tyr Arg Val Asp Trp Thr Val Leu Pro Glu Pro Ala Glu Val 
2750 2755 2 760 276 5 

CCC GCG CCG CGC TGG GCC CTC CTC GGC GAG GAC CAC GCC GGC CTC GCC 
Pro Ala Pro Arg Trp Ala Leu Leu Gly Glu Asp His Ala Gly Leu Ala 
2770 2775 2780 

GAT GTC CTC GGA GGG ACG GGC GGC GGC TCC GAG CGG TAC GAC ACC CTC 
Asp Val Leu Gly Gly Thr Gly Gly Gly Cys Glu Arg Tyr Asp Thr Leu 
2785 2790 2795 

ACC GGC CTC CTC GAG GCC ACC ACC CGG TCG GCC GGC GGA ATC CTC CCC 
Thr Gly Leu Leu Glu Ala Thr Thr Arg Ser Ala Gly Gly He Leu Pro 
2800 2805 2810 



27776 



27824 



27872 



27920 



27968 



28016 



28064 



28112 



28160 



28208 



28256 



28304 



28352 



28400 



28448 
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GAC ATC GTC GCC CTC TCC TTG CCC ACC GCC CCG GAG CCC GCC CCC CAG 
Asp lie Val Ala Leu Ser Leu Pro Thr Ala Pro Glu Pro Gly Pro Gin 
2815 2820 2825 

GCG GTC CGC GAG GTG CTG TCC CAG GCG CTC GAC GCC GCC CAG GCG TGG 

™ Val *** G1U Val Leu Ser Gln A1 * Leu As P AU Ala Gin Ala Trp 
2830 2835 2 840 2 84S 

CTG GCC GCC GGC GCC GAG ACC GCC TCC GCC CGG CTG GTG TTC GTC ACC 
Leu Ala Ala Gly Ala Glu Thr Ala Ser Ala Arg Leu Val Phe Val Thr 
2850 2855 2860 

GGC GGC GCG GTG GCC ACC ACG GCG GAC GAA ACC GTG CGC GAC ATC GCG 
Gly Gly Ala Val Ala Thr Thr Ala Asp Glu Thr Val Arg Asp He Ala 
2865 2870 2 875 

GCG GCC GCC GTC TGG GGC CTG GTC CGC TCG GCG CAG TCC GAG GAA CCC 
Ala Ala Ala Val Trp Gly Leu Val Arg Ser Ala Gin Ser Glu Glu Pro 
2880 2885 2 890 

GAC CGC ATG GTC CTG CTC GAC CTC GAC GGC GAG CGG CCC ACC GCG CGG 

P A »L Met Val LCU Leu Asp Leu G1 * G1 « Arg Pro Thr Ala Arg 
2895 2900 2905 

ACG CTG GCG GCG GCG CTC GCG TCC GGC GAA CCG CAA CTC GCC GTG CGC 
Thr Leu Ala Ala Ala Leu Ala Ser Gly Glu Pro Gin Leu Ala Val Arg 
2910 2915 2920 2925 

GGC TCC ACG GTC GCC GCT CCC CGG CTC GCC CCG GCC GGG CCC GGC CCG 
Gly Ser Thr Val Ala Ala Pro Arg Leu Ala Pro Ala Gly Pro Gly Pro 
2930 2935 2940 

GAG GAC CTC GTA CCG CCC GCC GGC ACC ACC GCC TGG CGG CTC ACC CCC 
Glu Asp Leu Val Pro Pro Ala Gly Thr Thr Ala Trp Arg Leu Thr Pro 
2945 2950 2 955 

GGC GGG GGG ACG CTC CAG GAA CTC TCG CTC GCG CCC GCC CCC GAC GCG 
Gly Gly Gly Thr Leu Glu Glu Leu Ser Leu Ala Pro Ala Pro Asp Ala 
2960 2965 2970 

GAG GAA CCA CTC GCA CCG GGC CAG GTA CGC ATC CCC GTC CGC GCG GCG 
Glu Glu Pro Leu Ala Pro Gly Gin Val Arg lie Ala Val Arg Ala Ala 
2975 2980 2985 

GGC GTC AAC TTC CGC GAC GCC CTC ATC GCC CTC GGC ATC TAC CCG GGC 
Gly Val Asn Phe Arg Asp Ala Leu lie Ala Leu G^ 2 £r Pro Gly 
2990 2995 3000 3005 

AAG GGA ACC ATC GGC GCC GAG GGA GCC GGC GTC GTC GTC GAG ACC GCC 
Lys Gly Thr Met Gly Ala Glu Gly Ala Gly Val Val Val Glu Thr Ala 
3010 3015 3020 

Pro So S*? if 7™ TCC GCC GGA GAC CGC GTC CTC GGC ATC TGG 

Pro Asp Val Thr Gly Leu Ser Ala Gly Asp Arg Val Leu Gly Met Trp 

3025 3030 3035 

™ 5? C ?f ™ 0«5 CCC CTC GTC GTC GCC GAC CAC CGC ATC GTG GCC 
Asn Gly Gly Phe Gly Pro Leu Val Val Ala Asp His Arg Met Val Ala 
J040 3045 3050 



28496 



28544 



28592 



28640 



28688 
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28832 



28880 



28928 



28976 



29024 



29072 



29120 



29168 
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CCG ATC CCC CAC CGC TCG TCG TAC GCC GAG GCG GCC TCC GTG CCC GCC 29216 
Pro lie Pro His Gly Trp Ser Tyr Ala Glu Ala Ala Ser Val Pro Ala 
3055 3060 3065 

S GTC CTC CTC ACC TCC TAC TAC GCG CTC ACC CGG CTG GCC CGG GCC CGC 29264 

Y^L LeU LeU Thr Ser 1Vr Tyr Ala Leu Thr Arg Leu Ala Ala Arg 
3070 3 075 3080 3085 

ACC GGA CAG ACC GTC CTC GTC CAC GCC GCC GCC GGC GGT GTC GGC ATG 29312 
10 Thr Gly Gin Thr Val Leu Val His Ala Ala Ala Gly Gly Val Gly Met 

3 <>90 3095 3100 

GCG ACC CTC CAA CTC GCC CGC CAC CTC GGC CTG GAG GTG TAC GCC ACC 29360 
Ala Thr Leu Gin Leu Ala Arg His Leu Gly Leu Glu Val Tyr Ala Thr 
f5 3105 3 U0 3115 

GCG AGC ACC GGC AAA TGG GAC GCC CTC CAG AAG CAC GGC ATC CCC GAC 29408 
Ala Ser Thr Gly Lys Trp Asp Ala Leu Gin Lys His Gly He Pro Asp 
3120 3125 3130 

20 GAC CGC ATC GCC GAC TCC CGC ACC CTC GAC TTC GCC GAG CGC TTC CTC 29456 

Asp ^ * le Ala Asp Ser Leu Asp Phe Ala Glu Arg Phe Leu 

3135 3140 3145 

TCC CGG ACG GGC GGC CGG GGT GTC GAC ATC GTG CTG AAC TCC CTG GCC 29504 

25 ffJln** 9 Gly Gly Gly Val ASP He Val Leu Asn Ser Leu Ala 

3150 3155 316O 3i 6 5 

GGC GAG TTC GTC GAC GCC TCA CTG CGG CTG CTG CCG CGC GGC GGG CAC 
Gly Glu Phe Val Asp Ala Ser Leu Arg Leu Leu Pro Arg Gly Gly His 
3170 3175 3180 

TTC CTC GAA CTC GGC AAG GCC GAC GTC CGC GAC CCC CGG CGG ATC GCC 
Phe Leu Glu Leu Gly Lys Ala Asp Val Arg Asp Pro Arg Arg lie Ala 
3185 3190 3195 

GCC GCC CAT CCG GGC ACC GAC TAC CGG GCG TTC GAC CTC GTG CAG GCC 29648 
Ala Ala His Pro Gly Thr Asp Tyr Arg Ala Phe Asp Leu Val Gin Ala 
3200 3205 3210 



30 



35 



GGT CCC GAC ACC GTC GGG GAG ATG CTC GGG GAA CTC CTG GAA CTG TTC 
Gly Pro Asp Thr Val Gly Glu Met Leu Gly Glu Leu Leu Glu Leu Phe 
40 32 15 3220 3225 



45 



29552 



29600 



29696 



29792 



S a? ^ f?° CTG 0(30 CCG CTG CCG CTC ACC GCC TAC GGC ATA CGC 29744 
Ala Ala Gly Ala Leu Arg Pro Leu Pro Leu Thr Ala Tyr Gly lie Arg 
3230 3 235 3240 324 5 

GAC GCC CGC ACC GCC TTG CGC ACC CTC ACC CAG GCC CGG CAC ACC GGC 
Asp Ala Arg Thr Ala Leu Arg Thr Leu Ser Gin Ala Arg His Thr Gly 
32 50 3255 326O 

SO ^ 01X3 ACG GTC CCT GCC ^A TTC GAC ACC CAC CGC ACG GTG 29840 

» Lys Leu Val Leu Thr Val Pro Ala Gly Phe Asp Thr His Arg Thr Val 

3265 3270 3275 

CTC CTC ACC GGC GGC ACG GGC ACG CTC GGC CAG ACA CTC GCC CGC CAT 29888 
Leu Leu Thr Gly Gly Thr Gly Thr Leu Gly Gin Thr Leu Ala Ar£ His 
55 3 2 80 3285 3290 
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CTG GTC AAC CGC CAC GGC GTA COG CAC CTG CTG CTC GCC GGC CGC ACC 
Leu Val Asn Arg His Gly Val Arg His Leu Leu Leu Ala Gly Arg Thr 
J " b 3300 3305 

M * ^ ^ GGC G TC GCG GAA CTC ATT GGT GAA CTG GGC GAG 

Gly Ala Ala Ala Glu Gly Val Ala Glu Leu He Gly Glu Leu Gly Glu 
3310 3315 3320 3325 

7*° ^ ff* 2^ GTC CGG GTC GCG GCC TGC GAC GCC GCC GAC CGG CAG 
Leu Gly Ala Glu Val Arg Val Ala Ala Cys Asp Ala Ala Asp Arg Gin 
3330 3335 3340 

CGG CTC ACC GAA CTC CTC GCC GGA ATC CCC GTC GAG CAC CCG CTC GGC 
Arg Leu Thr Glu Leu Leu Ala Gly lie Pro Val Glu His Pro Leu Gly 
3345 3350 3355 

S SS Sal Hil Sf £° « JfC GAC GAC GGC ACC ATC CCC TCA 

Ala Val Val His Ala Ala Gly Thr Leu Asp Asp Gly Thr He Pro Ser 
3360 3365 3370 

CTG ACC GGC GAG AAC ATC GAC AAC GTG CTG CGG CCC AAG GCC GAC GCC 
Leu Thr Gly Glu Asa He Asp Asn Val Leu Arg Pro £s Ma £p Sa 
3375 3380 3385 

GTC CTC AAC CTG CAC GAG CTG ACC CGC GAC GCC GAC CTC TCG GCG TTC 
Val Leu Asn Leu His Glu Leu Thr Arg Asp Ala Asp Leu Ser Sa £2 
3395 3400 3405 

GTC CTC TAC TCG TCC TCC TCG GCG CTG CTC GGC AGC CCC GGC CAG GGT 
Val Leu Tyr Ser Ser Ser Ser Ala Leu Leu Gly Ser Pro oTy Ol n Gly 
3410 3415 3420 

Ma £r Ma £° ^ ^ ^ ^ GAC GGC TTC GCC CGA TAC CGC 

Ala Tyr Ala Ala Ala Asn Ala Phe Leu Asp Gly Phe Ala Arg Tyr Arg 
3425 3430 3435 

£s alt r° ° CG GCG CTC TCG CTC GCC TGG GGA CTG TGG GGC 

V S?n V Ma LeU Set LCU Ala Tr P G ly ^u Trp Gly 

* 40 3445 3450 

AGC AAC AGC CGC ATG GCG GGC CAC CTC GAC CAG TCG GGC ATC CAA CCG 
Ser Asn Ser Arg Met Ala Gly His Leu Asp Gin Ser Gly S g£ 

3460 3465 * 

CGC CTG AAC CGG AGC GGC ATC ATG GCG CTC ACC GAC GCC GAG GGC CTC 
Ar^Leu Asn Arg Ser Gly lie Met Ala Leu Thr Asp £a ofu Gly iTu 
3475 3480 34 85 

S f" 3 3^ GA ° ^ CAG GAC GGC GGG GAC GCG CTG CTG GTG CCG 

Ala Leu Phe Asp Ala Ala Gin Asp Gly Gly Asp Ala Leu Leu Val Pro 
3490 3495 3 5 oo 

ATC CGG CTC AAC CGG ACG GCC CTT CGC GCC TCG GGA CGG ATC ACC CCG 
Met Arg Leu Asn Arg Thr Ala Leu Arg Ala Ser Gly £ S Tnr Pro 
3505 3510 3515 

TTC CTC AGC GGC TTG GCC GGC GGC GGG CCG GCG GCG GGG GAG AGG CGC 
Ph. Leu Se^Gly Leu Ala Gly oiyCly Pro Ala Ala SyoE £ S 



29936 



29984 



30032 



30080 



30128 



30176 



30224 



30272 



30320 



30368 



30416 



30464 



30512 
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30608 
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CCC GAG GTG GCA GCC GTA TCC GGG ACA CTC GCG GAA CGG CTG ACC GGG 
Pro Glu Val Ala Ala Val Ser Gly ftr Leu Ala Glu £ {£ S 
■"JS 3540 3545 

CTC ACG GCA CAG GAA GGG CAC GCC CTC GTC CTG GCC GAG ATC CCT err 
Leu Thr Ala Gin Glu Gly His Ala Leu Val Leu £a gJu SgS 
3555 3560 3565 

S^^f^P 3 ^ CAC ^ TCC GAC GAC TCG ATC CCC GAG 
His Ala Ala Ala Val Leu Gly His Gly Ser Asp Asp Ser He Pro Glu 

3570 3575 3 58 o 

GAC CGG GCC TTC AAG GAC CTC CGC TTC GAC TCG CTC ACC GCC GTG GAG 
Asp Arg Ala Phe Lys Asp Leu Gly Phe Asp Ser Leu Thr Ala Val Glu 
3585 3590 3595 

ATC CGC AAC CGG CTG AGC GCG GCC ACC GGC CTC CGG CTG CCC GCC ACC 

9 LSU Ala Ala Thr G1 * Leu ^u Pro Ala Thr 

3500 3605 3610 

CTC GTC TTC GAC CAC CCG ACC CCG GGC GAG CTG GCC GGC CAC CTC ACT 
Leu Val Phe Asp His Pro Thr Pro Gly Glu Leu Ala Sy His L^ Ser 
blb 3620 3 625 

OCT GAA CTG TCC GCC GAC GAT GCC CCG GGC AGC GCC TCC CCG CTT ACC 

36*30 " A ^ C ASP Ala Pr ° Gly Ser Ala S ~ P~ 2! ihr 

3635 3640 3 6 45 

GAA CTC GAC CGT TTC GAA GCC CTG TTC ACC GCT CTC GCA CCG GGG ACC 
Glu Leu Asp Arg Phe Glu Ala Leu Phe Thr Ala Leu Ala Pro G^ ihr 
3650 3655 366O 

ACC AAG GAC ACC CCG GGC GGG GCC GGG GCA CTG ATC ATC GAC GAG GCC 
Thr Lys Asp Thr Pro Gly Gly Ala Gly Ala Leu Met 55 Asp ctu Ma 
3665 3670 3 675 

° AG GGC GAG ATC GCC GGG CGG CTC GCG GCG CTC GCC GGT CTC TCG 

Glu Arg Gl^Glu lie Ala Gly Ar^Leu Ala Ala Leu Ala 2S *rrp 

AAC CGG CTC CAC GGC ACC ACG ACG GCT CCT GAG GAC GGC GAC ACC GTC 
Asn Arg Leu His Gly Thr Thr Thr Ala Pro Glu Asp Gly Asp iS SS 

3700 3705 

GCG GAC GCC CTG GAA GCC GCG GAC GAC CAC GAG ATC TTC GCA TTC CTC 
Ala Asp Ala Leu Glu Ala Ala Asp Asp His Glu He Phe Sa p£ 2S 
3715 3720 3725 

2p Su S ™ GCCCGCCCCA <*GACAGCAC AGGTCAAAAC AC ATC GCC 

3730 J et Ala 

??° ^ CAG CAA CTC CGT GCC TAT CT« AAG CGA GCG ACG ACC 

Asn Ala Asn Glu Gin Gin Leu Arg Ala Tyr Leu Lys Arg Sa Jhr ihr 
5 10 15 

GAA CTC GAC ^ GT ACC TCC GAA CAA CTC AGG GAG GAG CGG GCA CGG GCC 
Glu Leu His Arg Thr Ser Glu Gin Leu Arg Glu Glu Arg Ma Arg Sa 

25 30 
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CAC GAG CCC ATC GCC GTC GTC GGC ATG GCC TGC CGC TAC CCC GGA GGC 31381 

His Glu Pro lie A la Val Val Gly Met Ala Cys Arg Tyr Pro fly S£ 3 " 81 
40 45 5 o 

GCG AAC ACC CCC GAA CAG TTC TGG GAA CTG CTC GAC ACC GGC ACC car moo 

Ala Asn Thr Pro Glu Gin Phe Trp Glu Leu lTu Asp £ Jnr Asp 31429 
55 60 65 



Tyr Asp Pro Asp Pro Ala Ala Ala oTy Arg Tor £ Cys Arg Glu Gly 

90 95 

GGC TTC CTC CAC GAC GCG GGC GAC TTC GAC GCG GAC TTC TTC GGG ATT 
Gly Phe Leu Hxs Asp Ala Gly Asp Phe Asp Ala Asp p£ J£ 

105 110 

20 TCG CCG CGT GAG GCG GTG GCG ATG GAT CCG CAG CAG CGG CTG TTG CTG 

Ser Pro Arg Glu Ala Val Ala Met Asp Pro Gin Gin Art 2^ ^ Z 
120 125 130 

135 140 14 § r 

CTC CGC GGC AGC CGC ACC GGG GTG TAC GTG GGC GCC TGG GAC AGC GGC 
Leu Arg Gly Ser Arg Thr Gly Val Tyr Val Gly Ala J£ Sp sfr fly 

155 160 

TAC ACC GGC CAG GCG CAC GCG CCC TCG GCC GAG TTG GAG GCC GAC nr 
Tyr Thr Gly Gin Ala His Ala Pro Ser Ala Glu IS Olu £a Asp £u 

170 175 



2JS 220 " 225 



cS So f «f ^ GCC GGT GGT GTG ACG GTG ATG GCG ACC CCG GCG 
Cys Asp Leu Ala Leu Ala Gly Cly Val Thr Val Met Ala Thr pS aS 

230 23 5 240 



TGC AAG GCG TTC GCC GAC GCC GCC GAC GGC TTC GGC CCC GCC ran 

„ 01,5 & «■ - % «• «, ™ s s s s: sj 



31477 



Sa Sa S Pro J"? ?* ^ 00 *-GG GGA TGG GAC ACC CAC GGG CTG 
Ala Ala Ala Pro Met Pro Ser Asp Arg Gly Trp Asp Thr His Gly Leu 

7 5 80 
£r So S 2£ S2 ^ fCG GGG CGC ACC TAC TGC CGG GAG GGC 31525 



31573 
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CTG GGG ATG GTG CTG GTG GAG CGG TTG TCG GAT GCC CCG CGG TTG GGG 32101 
Val Gly Met Val Leu Val Glu Arg Leu Ser Asp Ala Arg Arg Leu Gly 
275 280 285 * " 290 

CAT CCG GTG TTG GCG GTG GTG TGT GGG TCG GCG GTG AAT CAG GAC GGT 32149 
His Pro Val Leu Ala Val Val Cys Gly Ser Ala Val Asn Gin Asp Gly 
295 300 305 

GCG TCG AAT GGT TTG ACG GCG CCG ACT GGT CCG TCG CAG GAG CGG GTG 32197 
Ala Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ser Gin Glu Arg Val 
310 315 320 

ATT CGT CAG GCG TTG GGG AAT GCG CGG TTG ACG GTG GCG GAT GTG GAT 32245 
He Arg Gin Ala Leu Gly Asn Ala Arg Leu Thr Val Ala Asp Val Asp 
325 330 335 

GTG GTG GAG GCG CAT GGG ACG GGG ACG CGG CTG GGT GAT CCG ATC GAG 32293 
Val Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro lie Glu 
340 345 350 

zo GCG CAG GCG TTG CTG GGG ACG TAT GGG CGG GAT CGT GAT GGT GGG CGT 32341 

Ala Gin Ala Leu Leu Gly Thr Tyr Gly Arg Asp Arg Asp Gly Gly Arg 
355 3 60 365 370 

CCG GTG TGG TTG GGG TCG TTG AAG TCG AAT ATT GGT CAT GCT CAG GCG 32389 
Pro Val Trp Leu Gly Ser Leu Lys Ser Asn He Gly His Ala Gin Ala 
575 380 385 

GCT GCG GGG GTG GCT GGT GTG ATC AAG ATG GTG TTG GCG ATG CGG TAT 32437 
Ala Ala Gly Val Ala Gly Val lie Lys Met Val Leu Ala Met Arg Tyr 
390 395 400 

30 

GGG TGG TTG CCG CGG ACG TTG CAT GTG GAT GAG CCG AGC CGG CAT GTG 32485 
Gly Trp Leu Pro Arg Thr Leu His Val Asp Glu Pro Ser Arg His Val 
405 410 415 

35 ? AC P° TCG 007 081 GGT GTG CGG TT° CTG ACC GAG GCG CGG GAG TGG 32533 

Asp Trp Ser Ala Gly Gly Val Arg Leu Leu Thr Glu Ala Arg Glu Trp 
42 0 425 430 

CCG GGG GTG GAC CGG CCG CGT CGG GCG GCG GTC TCC GCC TTT GGT GTC 
Pro Gly Val Asp Arg Pro Arg Arg Ala Ala Val Ser Ala Phe Gly Val 
40 435 440 445 450 

ACT GGT ACC AAC GCC CAT CTG ATC CTC GAA GCC CCC GAC ACC GCC GAG 
Ser Gly Thr Asn Ala His Leu lie Leu Glu Ala Pro Asp Thr Ala Glu 
455 460 465 



32581 



32629 



GCG GAG AGC GCC ACG ACC CCG GTC CGC TCT GAG GTG TCG GAG TCT GCT 32677 
Ala Glu Ser Ala Thr Thr Pro Val Arg Ser Glu Val Ser Glu Ser Ala 
4? 0 475 480 

GCC CTC CTC GAT GCC CGC ACT GGT GTG GTG CCG GTG GTG GTT TCG GGG 32725 
Ala Val ^ eu Ala Arg Ser Gly Val Val Pro Val Val Val Ser Gly 
485 490 495 * 

CGT TCG CGG GTG GTG GTG CGG GAG GCT GCG GGC CGG TTG GCG GAG GTG 32773 
Arg Ser Arg Val Val Val Arg Glu Ala Ala Gly Arg Leu Ala Glu Val 
55 500 505 510 
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32821 



CTC GAG GCC GCT CCT GTS GGG CTG GCG GAT GTG GCG GTG ACQ ATG GCG 
Val Glu Ala Gly Gly Val Gly Leu Ala Asp Val Ala Val Thr Met Ala 
515 520 525 530 

5 GGC CGG TCG CGG TTT GGG TAT CGG GCG GTT GTG CTG GCT CGG GGT GAG 32869 

Gly Arg Ser Arg Phe Gly Tyr Arg Ala Val Val Leu Ala Arg Gly Glu 
535 540 545 

GCT GAG CTT GCC GGG CGT TTG CGG GCG TTG GCG GGG GGT GAT CCC GAC 32917 
„ Ala Glu Leu Ala Gly Arg Leu Arg Ala Leu Ala Gly Gly A3 P Pro Asp 

550 555 560 



GCG GGT GTG GTC ACQ GGT GCG GTG GTG GAC CCC GAG ACG GGG TCC GGT 
Ala Gly Val Val Thr Gly Ala Val Val Asp Pro Glu Thr Gly Ser Gly 
565 570 575 y 



- — A v.^i w k.jvj uug ACG CAG TGG GTG 

Gly Gly Gly Val Val Leu Val Phe Pro Gly Gin Gly Thr Gin Trp Val 
580 585 



32965 



33013 



33061 



33109 



33157 



GGT GGG GGG GTC GTG TTG GTT TTC CCT GGT CAG GGG ACG CAG TGG GTG 

Gly 
590 

20 GGG ATG GGT GCG GGG CTG CTG GGG TCT TCG GAG GTG TTT GCG GCG TCG 

Gly Met Gly Ala Gly Leu Leu Gly Ser Ser Glu Val Phe Ala Ala Ser 
595 600 605 610 

ATC CGG GAG TGT GCG CGG GCG CTG AGT GTT CAT GTC GGG TGG GAT TTC 
Met Arg Glu Cys Ala Arg Ala Leu Ser Val His Val Gly Trp Asp Leu 
615 620 625 

2S rf v 1 ? TCG GGC GGG GCC GGG TTC GAG CGG GTC GAT GTC GTC 

Leu Glu Val Val Ser Gly Gly Ala Gly Leu Glu Arg Val Asp Val Val 
630 635 " 640 

CAG CCG GTC ACG TCG GCG GTC ATC GTC TCG CTG GCC CGG TAC TGG CAG 33205 
Gin Pro Val Thr Trp Ala Val Met Val Ser Leu Ala Arg Tyr Trp Gin 
64i 650 655 

GCG ATG GGT GTG GAC GTG GCT GCG GTG GTC GGT CAT TCC CAG GGG CAC 

Ala Met Gly Val Asp Val Ala Ala Val Val Gly His 2 £n Gly Su 33253 

S SI ST S° GTG GCG GCG TTC TCG CTG GAG GAT GCG GCG 

lie Ala Ala Ala Thr Val Ala Gly Ala Leu Ser Leu Glu Asp Ala Ala 
680 685 690 

GCT GTC GTC GCT CTG CGG GCG GGG TTG ATT GGC CGG TAT CTG GCG GGT 
Ala Val Val Ala Leu Arg Ala Gly Leu lie Gly Arg Tyr 22 Sa Sy 
695 700 7 05 



33301 



33349 



33397 



CGT GGT GCG ATC GCG GCT GTT CCG CTG CCT GCC GGC GAG GTC GAG GCC 
Arg Gly Ala Met Ala Ala Val Pro Leu Pro Ala Gly Glu Val Glu S 
710 715 ' 720 

SS^f^f ^ GTG GAG GTC «* GCG GTC AAC GGT CCG 33445 

so Gly Leu Ala Lys Trp Pro Gly Val Glu Val Ala Ala Val Asn Gly Pro 

725 730 735 

X 2r Jhr 2? ST r ^ GAT CGG CGG GCG GTG GCC GGT TAT GTG 
Ala Ser Thr Val Val Ser Gly Asp Arg Arg Ala Val Ala Gly Tyr Val 
55 /<lv 745 750 



33493 



S3 
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GCC GTC TGT CAC GCG GAG GGT GTG CAG GCT CGG TTG ATA CCG GTG GAC 33541 
Ala val Cys Gin Ala Glu Gly Val Gin Ala Arg Leu lie Pro Val Asp 

S TAC GCC TCT CAC TCC CGC CAT GTG GAG GAC CTG AAG GGC GAG TTG GAG 33 589 

Tyr Ala Ser His Ser Arg His Val Glu Asp Leu Lys Gly SS IS Glu 
775 780 7 85 

CGG GTG CTG TCC GGT ATC CGC CCC CGC ACT CCG CGG GTG CCG GTG TGT 33637 
Arg Val Leu Ser Gly Xle Arg Pro Arg Ser Pro Arg Val Pro Val Cys 
790 795 800 

£° ^ f,I? °f S? A 2*° CA ° CCG ^ GAG CCG GTT TTC GAT GCG GGG 33685 

" Phe 
815 



is 



25 



30 



45 



— — • — ™j uw. wu* On TTC GAT GCG GGG 

Ser Thr Val Ala Gly Glu Gin Pro Gly Glu Pro Val Phe Asp Ala Gly 
805 810 



TAT TGG TTC CGT AAT CTG CGG AAC CGG GTT GAG TTC TCC GCG GTG GTC 
Tyr Trp Phe Arg Asn Leu Arg Asn Arg Val Glu Phe Ser Ala Val Val 
820 825 830 



855 860 



865 



aS stl ST °f £ C °?° ACC CTC CGC CGC CAG GAC AGC CCG 

Arg Ser Val His Ala Thr Gly Thr Leu Arg Arg Gin Asp. Asp Ser Pro 
otv 875 



880 



CAC CGC CTG CTG ACC TCC ACC GCC GAG GCC TGG GCC CAC GGC GCC ACC 

9 S Leu Thr Ser Thr Ala Glu Ala Tr ? Ala His Sy S S 

eus 890 



910 



900 905 

ACC TAC CCC TTC AAC CAC CAC CAC TAC TGG CTC GAC ACC ACC CCC ACC 

Thr Tyr Pro Phe Asn His His His Tyr Trp Leu Asp ThJ Jhr Pro J£ 

40 313 920 925 930 

i£ £ G ACC CAG AGC CCC ACC GA T GCC TGG CGC TAC CGC 

Thr Pro Ala Thr Thr Thr Gin Ser Pro Thr Asp Ala Trp Arg Tyr Arg 

935 940 



945 



GTC ACC TGG AAA GCC CTG ACC GAA GAA TCC ACT CCG GCC TCG TCC CCC 
Val Thr Trp Lys Ala Leu Thr Glu Glu Ser Thr Pro Ala Ser Ser Pro 
950 955 960 



CTC GGG GAC CGG GCC GCC GGC GCC CTC GCA CGT CAG GGG GCC ACG GTG 
Leu Gly Asp Arg Ala Ala Gly Ala Leu Ala Arg Gin Gly Ala Thr vS 
55 yao 985 



33733 



20 GGT GGT TTG TTG GAG GAG GGC CAC CGT CGG TTC ATC GAG GTC ACT GCC 33781 

Gly Gly Leu Leu Glu Glu Gly His Arg Arg Phe He Glu Si SeT £a ' 
835 840 845 S50 

CAC CCG OTA CTC GTC CAT GCC ATT GAG CAG ACG GCC GAG GCC GCG GAC 33829 
Hxs Pro Val Leu Val His Ala lie Glu Gin Thr Ala Glu Ala A^ Asp 



33877 



33925 



895 

35 T CTC f£ C P° GAC CCC GCC CTG CCC C CA GGC CAC CTC ACC ACC CTC CCC 33973 

Leu Thr Trp Asp Pro Ala Leu Pro Pro Gly His Leu Thr Thr Le^ Pro ' 



34021 



34069 



34117 



TCC GGT CAC TGG CTC CTC GTC ACA CCC CCG ACC CCC GAA GGC CGC ACG 34165 
Ser Gly Hxs Trp Leu Leu Val Thr Pro Pro Thr Pro Glu Gly S £ 



34213 



990 
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GAA CGG CTG CTG GTC GAT CCG GTC GCC GTC GCA CGC GAC GGG CTC GCG 
Glu Arg Leu Val Val Asp Pro Val Ala Val Gly Arg Asp Gly Leu Ala 
995 1000 1005 1010 

GCG CGC CTG GGC GAA CGG TGG GAC GGT GTG CTG TCC CTG CTC GGC GCC 
Ala Arg Leu Gly Glu Arg Trp Asp Gly Val Leu Ser Leu Leu Gly Ala 
1015 1020 1025 

GAC GAG CGT CCG CTC CCA CGG CAT CCC GCC CTC AAC CGC GCC GTC ATG 
Asp Glu Arg Pro Leu Pro Arg His Pro Ala Leu Asn Arg Ala Val Met 
1030 1035 1040 

GGC ACC ACG CTG CTC GCC CAG GCC GCT CTG GAC GCA GGA TGC GAG GCG 
Gly Thr Thr Leu Leu Ala Gin Ala Ala Leu Asp Ala Gly Cys Glu Ala 
1045 1050 1055 

CGG ATA TGG GCC GTG ACG CGG GAG GCC GTC GCC GTC TCC CCG AGC GAG 
Arg He Trp Ala Val Thr Arg Glu Ala Val Ala Val Ser Pro Ser Glu 
1060 1065 1070 

GTG CCG CGG GAC GCC GGC GCG CAG CTC TGG GGG CTC GGG CGG GGC ATC 
Val Pro Arg Asp Ala Gly Ala Gin Leu Trp Gly Leu Gly Arg Gly He 
1075 1080 1085 1090 

GCG CTG GAA CAC CCC TCC CTC TGG GGC GGA TO ATC GAT CTG CCC GCC 
Ala Leu Glu His Pro Ser Leu Trp Gly Gly Leu He Asp Leu Pro Ala 
1095 iioo H05 

GTG CCG GAC GAA CGC GCG TGG GCC AGG GCC GTC CGG CGG CTC GTC CCG 
Val Pro Asp Glu Arg Ala Trp Ala Arg Ala Val Arg Arg Leu Val Pro 
HIO 1115 U20 

CAC GGT GAG GAC CAG ATC GCC GCG CGC GCC TCG GGT GCC TAT GGG CGC 
His Gly Glu Asp Gin He Ala Ala Arg Ala Ser Gly Ala Tyr Gly Arg 
H25 H30 1135 

AGG CTC CTG CCG GCT CCG CCG GCC GCG TCG CGC CGC ACC TCC ACA CCG 
Arg Leu Leu Pro Ala Pro Pro Ala Ala Ser Arg Arg Thr Cys Thr Pro 
H40 H45 H50 

TCC GGC ACG GTG CTG GTC ACC GGC GGT ACG GGA GCG CTC GGC GGT CAT 
Ser Gly Thr Val Leu Val Thr Gly Gly Thr Gly Ala Leu Gly Gly His 
1155 1160 H65 ' mo 

CTG GCC CGC CGT CTC GCA CGC GGC GGG ACC GGG CAT CTG GTG CTC ACC 
Leu Ala Arg Arg Leu Ala Arg Gly Gly Thr Gly His Leu Val Leu Thr 
II 75 H80 H85 

AGC CGT CGC GGC CCG GAC GCG CCG GGC GCC GGT GAA CTC GCC GGT GAA 
Ser Arg Arg Gly Pro Asp Ala Pro Gly Ala Gly Glu Leu Ala Gly Glu 
H 9 0 1195 1200 

CTC GCC TCC CTG GGC GCG AAG GTC ACG GTC GCC GCG TGC GAC ATC GCC 
Leu Ala Ser Leu Gly Ala Lys Val Thr Val Ala Ala Cys Asp Met Ala 
1205 1210 1215 

GAC CGT GAA GCC GTC CGG GCG CTG CTC GAC GAG CAC CGG CCG ACC GCG 
Asp Arg Glu Ala Val Arg Ala Leu Leu Asp Glu His Arg Pro Thr Ala 
1220 1225 1230 



34261 



34309 



34357 



34405 



34453 



34501 



34549 



34597 



34645 



34693 



34741 



34789 



34837 



34885 
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GTC TTC CAC ACG GCG GGC ACG CCC CAC TCG GOG GAG TTC ACG GCG CTC 
Val Phe His Thr Ala Gly Thr Pro His Ser Ala Glu Phe Thr Ala Leu 
1235 1240 1245 1250 

GAC GAG ACG ACG ACG GCC GGG GTG TAC GGC GGG AAG GTC CTG GGT GCC 
Asp Glu Thr Thr Thr Ala Gly Val Tyr Gly Gly Lys Val Leu Gly Ala 
1255 1260 1265 

CGG CAT CTG GAC GAA CTG ACC CGG GAA CTC GGC ATC GGG CTC GAC GCG 
Arg His Leu Asp Glu Leu Thr Arg Glu Leu Gly He Gly Leu Asp Ala 
1270 1275 1280 

TTC GTC CTC TTC TCC TCC GGC GCC GCG GTC TGG GGC AGC GGC GGC CAG 
Phe Val Leu Phe Ser Ser Gly Ala Ala Val Trp Gly Ser Gly Gly Gin 
1285 1290 1295 

ACC GCT TAC GGG GCC GCG AAC GCC GCG CTG GAC GCC CTC GCC GAG CGG 
Thr Ala Tyr Gly Ala Ala Asn Ala Ala Leu Asp Ala Leu Ala Glu Arq 
1300 1305 1310 * 

CGC CGT GCC GCC GGG CTC CCC GCG ACC TCC GTC GCC TCG GGC CTC TCG 
Arg Arg Ala Ala Gly Leu Pro Ala Thr Ser Val Ala Trp Gly Leu Trp 
1315 1320 1325 1330 

GGC GGC GGA GGC ATC GGG GAG GGG GAC GGT GAG GAG TTC CTC AGC CGG 
Gly Gly Gly Gly Met Gly Glu Gly Asp Gly Glu Glu Phe Leu Ser Arg 
1335 1340 1345 

CGC GGC CTC GGC GTC ATC CCG CCG GAG GAC GCG CTG GAA GCC CTC GAC 
Arg Gly Leu Gly Val Met Pro Pro Glu Asp Ala Leu Glu Ala Leu Asp 
1350 1355 1360 

CGG GCC CTC GAC CGG GAG GAC ACC ACC GTC GTC GTC GCG GAT GTC GAC 
Arg Ala Leu Asp Arg Glu Asp Thr Thr Val Val Val Ala Asp Val Asp 
1365 1370 1375 

TGG GAG CGG TTC GCC CCG GCC TTC ACC GCG TTC CGG CCC AGT GCG CTC 
Trp Glu Arg Phe Ala Pro Ala Phe Thr Ala Phe Arg Pro Ser Ala Leu 
1380 1385 1390 

ATC TCC CGG CTC GTC TCG GAC GGC GGG GAG GCG GGG GGG CAG GAC GCC 
lie Ser Arg Leu Val Ser Asp Gly Gly Glu Ala Gly Gly Gin Asp Ala 
1395 1400 1405 1410 

CCG GAC GGC ACG CTC TTC GCC GCC GGG TTC GCG GCC GCC GGG CCA CTC 
Pro Asp Gly Thr Leu Phe Ala Ala Gly Phe Ala Ala Ala Gly Pro Leu 
1415 1420 1425 

GAG CGG CAG GAG ATC CTC CTC GGC CTC GTC CGC CGG CAT GTC GCC GCC 
Glu Arg Gin Glu Met Leu Leu Gly Leu Val Arg Arg His Val Ala Ala 
1430 1435 144Q 

GTA CTC GGC CAC CCG GGG ACC GCG GAC ATC GGT CCC GAC CGT GCT TTC 
Val Leu Gly His Pro Gly Thr Ala Asp He Gly Pro Asp Arg Ala Phe 
1445 1450 1455 

AAG GAG CTC GGG TTC AGT TCG GTC ACC GCC GTC GAG CTC GCC GGG CGG 
Lys Glu Leu Gly Phe Ser Ser Val Thr Ala Val Glu Leu Ala Gly Ara 
1460 i4 6 5 1470 



34981 



35029 



35077 



35125 



35173 



35221 



35269 



35317 
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35653 



56 



EP0 791 655 A2 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



CTC GGC CGG GAG TGC GGA CGG AAG CTC CCG CCG ACG CTC GTC TTC GAC 
Leu Gly Arg Glu Cys Gly Arg Lys Leu Pro Pro Thr Leu Val Phe Asp 
1475 1480 1485 1490 

CAT CCG ACT GCC GCG GCG GCC GTC GAA CAC CTC GCG GAG CTC CTC ACA 
His Pro Thr Ala Ala Ala Ala Val Glu His Leu Ala Glu Leu Leu Thr 
1495 1500 1505 

CCG CCC GCC GGT CCC GCC GCC GGT CCC CGG GAG GAG GAG GCG CGG GCC 
Pro Pro Ala Gly Pro Ala Ala Gly Pro Arg Glu Glu Glu Ala Arg Ala 
1510 1515 1520 

GCC CTC GCG CGC GTC CCG CTC GAA CGG CTC AGG GAA GCC GGC CTC CTC 
Ala Leu Ala Arg Val Pro Leu Glu. Arg Leu Arg Glu Ala Gly Leu Leu 
1525 1530 1535 

GAC GCA CTC CTC CGG CTC GCC GCG GAC GAA TCC GGG GCG ACA ACC CCC 
Asp Ala Leu Leu Arg Leu Ala Ala Asp Glu Ser Gly Ala Thr Thr Pro 
1540 1545 1550 

CGT ACG TCT GCC GCG TCC GGC GCA CCC CGC GGC CGG GAG GAG CCG GAC 
Arg Thr Ser Ala Ala Ser Gly Ala Pro Arg Gly Arg Glu Glu Pro Asp 
1555 1560 i5 6 5 15 ^ Q 

GGC CGC GGC GAG CCG GAC GGC TCG GGA CAC CGC GAA AGC CCG GAC GCG 
Gly Arg Gly Glu Pro Asp Gly Ser Gly His Arg Glu Ser Pro Asp Ala 
1575 1580 1585 

GCC GGC GGG TCG GAC GCC CTC GAC GAT CTC GAC GGG GAC GCC CTC GTC 
Ala Gly Gly Ser Asp Ala Leu Asp Asp Leu Asp Gly Asp Ala Leu Val 
lS90 1595 i 6 oo 

CGG CTC GCC CTC GGG GAA CCG GGC GAG TCA CCGGCCGGCG GAGCACACCC 
Arg Leu Ala Leu Gly Glu Pro Gly Glu * 
1605 1610 

GGCCGTCTCC GGCCCGGCCG CGGCCGGGCC GG AAGCCATC CGCCGCCCAC CCGGTACCGA 

CCCCTCAAGC CCTTCAAGCC CTTCGACCCG TCCGATCAGT CAGTCCGGCG GTCCTCCACG 

ACCGGTCCGG AATCGCCCCC ACACGAGTCA GGAAGCACAC C ATC GCC ATC TCC 

Met Ala Met Ser 
1 

GCC GAG AGG CTC ACG GAG GCG CTC CGG ACC TCG CTC AAG GAG GCC GAG 
Ala Glu Arg Leu Thr Glu Ala Leu Arg Thr Ser Leu Lys Glu Ala Glu 
5 1° 15 20 

CGG CTC CGG CGG CAG AAC CGC GAA CTC AGG GCC GCG CGG GAC GCG GCG 
Arg Leu Arg Arg Gin Asn Arg Glu Leu Arg Ala Ala Arg Asp Ala Ala 
25 30 35 

CGG GAG CCG ATC GCC GTC GTC GGC ATC GCC TCC CGC TAC CCG GGC GGT 
Arg Glu Pro lie Ala Val Val Gly Met Ala Cys Arg Tyr Pro Gly Gly 
40 45 50 

^ GGC CCC GAG GAG CTC TCG GAG CTC GTC GCC GGA GGC CGG GAC 

Val Thr Gly Pro Glu Glu Leu Trp Glu Leu Val Ala Gly Gly Arg Asp 
55 60 65 
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GCG ATC GGG CCG TTC CCC GTG GAC CGG GGC TGG GAC GTG GCG TCG GTG 
Ala lie Gly Pro Phe Pro Val Asp Arg Gly Trp Asp Val Ala Ser Val 
70 75 80 



TGC GGG CTC GCG CTG GCG GGC GGG GTG GCG GTG ATG GCC CAT CCG GCG 
Cys Gly Leu Ala Leu Ala Gly Gly Val Ala Val Met Ala Asp Pro Ala 
40 230 235 240 



CAC ACG GTC CTC GCC CTC GTC ACC GGC ACC GCG CTC AAC CAG GAC GGT 
His Thr Val Leu Gly Leu Val Thr Gly Thr Ala Val Asn Gin Asp Gly 
55 295 300 305 



36500 



36740 



TAC GAC CCG GAT CCC GAG TCG AAG GGC ACC ACG TAC TGC CGG GAG GGC 36548 
Tyr Asp Pro Asp Pro Glu Ser Lys Gly Thr Thr Tyr Cys Arg Glu Gly 
85 90 95 100 

GGG TTC CTG GAA GGC GCC GGT GAC TTC GAC GCC GCC TTC TTC GGC ATC 36596 
Gly Phe Leu Glu Gly Ala Gly Asp Phe Asp Ala Ala Phe Phe Gly He 
105 HO H5 

TCG CCG CGC GAG GCC CTG GTG ATG GAC CCG CAG CAG CGG CTG CTG CTG 36644 
Ser Pro Arg Glu Ala Leu Val Met Asp Pro Gin Gin Arg Leu Leu Leu 
120 125 130 

GAG GTG TCC TGG GAG GCG CTG GAA CGC GCG GGC ATC GAC CCG TCC TCG 36692 
Glu Val Ser Trp Glu Ala Leu Glu Arg Ala Gly He Asp Pro Ser Ser 
135 140 145 

20 CTG CGC GGC AGC CGC GGT GGT GTC TAC GTG GGC GCC GCG CAC GGC TCG 

Leu Arg Gly Ser Arg Gly Gly Val Tyr Val Gly Ala Ala His Gly Ser 
I 50 155 160 

TAC GCC TCC GAT CCC CGG CTG GTG CCC GAG GGC TCG GAG GGC TAT CTG 36788 

Ser Asp Pro ^ Val Pro C-1" Gly Ser Glu Gly Tyr Leu 

165 170 175 180 

CTG ACC GGC AGC GCC GAC GCG GTG ATG TCC GGC CGC ATC TCC TAC GCG 46836 
Leu Thr Gly Ser Ala Asp Ala Val Met Ser Gly Arg He Ser Tyr Ala 
185 190 195 

CTC GGT CTC GAA GGA CCG TCC ATG ACG GTG GAG ACG GCC TGC TCC TCC 36884 
Leu Gly Leu Glu Gly Pro Ser Met Thr Val Glu Thr Ala Cys Ser Ser 
200 205 210 

TCG CTG GTC GOG CTG CAT CTG GCG GTA CGG GCG CTG CGG CAC GGC GAG 36932 
Ser Leu Val Ala Leu His Leu Ala Val Arg Ala Leu Arg His Gly Glu 
215 220 225 



36980 



GCG TTC GTG GAG TTC TCC CGG CAG AAG GGG CTG GCC GCC GAC GGC CGC 37028 
Ala Phe Val Glu Phe Ser Arg Gin Lys Gly Leu Ala Ala Asp Gly Arg 
245 250 255 260 

TGC AAG GCG TTC TCG GCC GCC GCC GAC GGC ACC GGC TGG GCC GAG GGC 
Cys Lys Ala Phe Ser Ala Ala Ala Asp Gly Thr Gly Trp Ala Glu Gly 
265 270 275 



37076 



so ^° ^ f TC GTC ^ GAG CGG CTG TCG GAC GCG CGC CGC GCG GGG 37124 

Val Val Leu v *l ^u Glu Arg Leu Ser Asp Ala Arg Arg Ala Gly 
280 285 290 



37172 
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GCC TCC AAC GGG CTG ACC GCG CCC AAC GGC CCA GCC CAG CAA CGC GTC 37220 
Ala Ser Asn Gly Leu Thr Ala Pro. Asn Gly Pro Ala Gin Gin Arg Val 
)10 315 320 

5 tlf £. G?» 5° ! AC «* «* CTS TCC CCG GAG GAC GTC GAC 37268 

lie Ala Glu Ala Leu Ala Asp Ala Gly Leu Ser Pro Glu Asp Val Asp 
32S 330 335 340 

GCG GTC GAG GCG CAC GGC ACC GGC ACC CGG CTC GGC GAC CCC ATC GAG 37316 
Ala Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro lie G^ 
345 350 3 55 

CCC GGG GCG CTG CTC GCC GCC TCC GGA CGG AAC CGT TCC GGC GAC CAC 373 64 

Ala Gly Ala Leu Leu Ala Ala Ser Gly Arg Asn Arg Ser Gly S P Sis 
, s 360 365 370 

CCG CTG TGG CTC GGC TCG CTG AAG TCC AAC ATC GGG CAT GCC CAG GCC 
Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn He Gly His Ala Gin Ala 
375 380 385 

20 ^ C ^° °GT GTC GGC GGC GTC ATC AAG ATG CTC CAG GCG CTG CGG CAC 

Ala Ala Gly Val Gly Gly Val lie Lys Met Leu Gin Ala Leu Arg His 

GGC TTG CTG CCC CGC ACC CTC CAC GCC GAC GAG CCG ACC CCG CAT GCC 

%l Leu Leu Pro *s Leu His Ala Asp Glu Pro Thr Pro His A1 * 

41° 415 420 

GAC TGG AGC TCC GGC CGG GTA CGG CTG CTC ACC TCC GAG GTC CCG TGG 
Asp Trp Ser Ser Gly Arg Val Arg Leu Leu Thr Ser Glu Val Pro Tro 
42 5 430 435 

30 

?* ^ ° ^ CGG CCC CGG CGG ACC GGG GTC TCC GCC TTC GGC GTC 
Gin Arg Thr Gly Arg Pro Arg Arg Thr Gly Val Ser Ala Phe Gly v£ 
440 445 450 

S JS £n ?f u AT GTC CTC GAA GAG CCC GCC GCG CCC 

Gly Gly Thr Asn Ala His Val Val Leu Glu Glu Ala Pro Ala Pro Pro 

455 460 465 

GCG CCG GAA CCG GCC GGG GAG GCC CCC GGC GGC TCC CGC GCC CCA GAA 
Ala Pro Glu Pro Ala Gly Glu Ala Pro Gly Gly Ser Arg Ala aS SJ 

GGG GCG GAA GGG CCC CTC GCC TGG GTC GTC TCC GGA CGC GAC GAG CCG 
Gly Ala Glu Gly Pro Leu Ala Trp Val Val Ser Gly Arg Asp Glu Pro 
490 495 500 



45 



GCC CTC CGG TCC CAG GCC CGG CGG CTC CGC GAC CAC CTC TCC CGC ACC 
Ala Leu Arg Ser Gin Ala Arg Arg Leu Arg Asp His Leu Ser Arg t£ 
5 °5 510 515 



37412 



37460 



37508 



37556 



37604 



37652 



37700 



37748 



37796 



Pro G^ f ° o CG CGT GAC ATC GCC TO TCC GTC GCC GCC ACG CGC 37844 

Pro Gly Ala Arg Pro Arg Asp lie Ala Phe Ser Leu Ala Ala Thr Arg 

525 530 



S IT GAC CAC CGC 000 GTG CTG ATC GGC TCG GAC GGG GCC GAA 

Ala Ala Phe Asp Has Arg Ala Val Leu He Gly Ser Asp Gly Ala Glu 
ss 535 540 545 



37892 
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CTC GCC GCC GCC CTG GAC GCG TTG GCC GAA GGA CGC GAC GGT CCG GOG 37940 
Leu Ala Ala Ala Leu Asp Ala Leu Ala Glu Gly Arg Asp Gly Pro Ala 
550 555 560 

5 GTG GTG CGC GGA GTC CGC GAC CGG GAC GGC AGG ATG GCC TTC CTC TTC 37988 

Val Val Arg Gly Val Arg Asp Arg Asp Gly Arg Met Ala Phe Leu Phe 
565 570 575 580 

ACC GGG CAG GGC AGC CAG CGC GCC GGG ATG GCC CAC GAC CTG CAT GCC 38036 
10 Thr Gly Gin Gly Ser Gin Arg Ala Gly Met Ala His Asp Leu His Ala 

585 590 595 

GCC CAT ACC TTC TTC GCG TCC GCC CTC GAC GAG GTG ACG GAC CGT CTC 38084 

Ala His Thr Phe Phe Ala Ser Ala Leu Asp Glu Val Thr Asp Arg Leu 
600 605 610 

15 

GAC CCG CTG CTC GGC CGG CCG CTC GGC GCG CTG CTG GAC GCC CGA CCC 38132 

Asp Pro Leu Leu Gly Arg Pro Leu Gly Ala Leu Leu Asp Ala Arg Pro 
615 620 625 

20 GGC TCG CCC GAA GCG GCA CTC CTG GAC CGG ACC GAG TAC ACC CAG CCG 38180 

Gly Ser Pro Glu Ala Ala Leu Leu Asp Arg Thr Glu Tyr Thr Gin Pro 
630 635 640 

GCG CTC TTC GCC GTC GAG GTG GCG CTC CAC CGG CTG CTG GAG CAC TGG 38228 
Ala Leu Phe Ala Val Glu Val Ala Leu His Arg Leu. Leu Glu His Trp 
645 650 655 660 

GGG ATG CGC CCC GAC CTG CTG CTG GGG CAC TCG GTG GGC GAA CTG GCG 38276 
Gly Met Arg Pro Asp Leu Leu Leu Gly His Ser Val Gly Glu Leu Ala 
665 670 675 

GCC GCC CAC GTC GCG GGT GTG CTC GAT CTC GAC GAC GCC TGC GCG CTG 38324 
Ala Ala His Val Ala Gly Val Leu Asp Leu Asp Asp Ala Cys Ala Leu 
680 685 690 

GTG GCC GCC CGC GGC AGG CTG ATG CAG CGC CTG CCG CCC GGC GGC GCG 38372 
Val Ala Ala Arg Gly Arg Leu Met Gin Arg Leu Pro Pro Gly Gly Ala 
695 700 705 



25 



30 



35 



ATG GTC TCC GTG CGC GCC GGC GAG GAC GAG GTC CGC GCA CTG CTG GCC 38420 
Met Val Ser Val Arg Ala Gly Glu Asp Glu Val Arg Ala Leu Leu Ala 
40 7 1 0 7 1 5 7 2 0 

GGC CGC GAG GAC GCC GTC TGC GTC GCC GCG GTG AAC GGC CCC CGG TCG 38468 
Gly Arg Glu Asp Ala Val Cys Val Ala Ala Val Asn Gly Pro Arg Ser 
7 25 730 735 740 

45 

GTG GTG ATC TCC GGC GCG GAG GAA GCG GTG GCC GAG GCG GCG GCG CAG 38516 
Val Val He Ser Gly Ala Glu Glu Ala Val Ala Glu Ala Ala Ala Gin 
745 750 755 

CTC GCC GGA CGA GGC CGC CGC ACC AGG CGG CTC CGC GTC GCG CAC GCC 38564 
Leu Ala Gly Arg Gly Arg Arg Thr Arg Arg Leu Arg Val Ala His Ala 
760 765 770 

TTC CAC TCA CCC CTG ATG GAC GGC ATG CTC GCC GGA TTC CGG GAG GTC 38612 
Phe His Ser Pro Leu Met Asp Gly Met Leu Ala Gly Phe Arg Glu Val 
S5 775 780 785 
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GCC GCC GGC CTG CCC TAC CGG GAA CCG GAG CTG ACG GTC GTC TCC ACG 38660 
Ala Ala Gly Leu Arg Tyr Arg Glu Pro Glu Leu Thr Val Val Ser Thr 
790 795 800 

GTC ACG GGG CGG CCC GCC CGC CCC GGT GAA CTC ACC GGC CCC GAC TAC 38708 
Val Thr Gly Arg Pro Ala Arg Pro Gly Glu Leu Thr Gly Pro Asp Tyr 
805 810 815 820 

TGG GTG GCC CAG GTC CGT GAG CCC GTG CGC TTC GCG GAC GCG GTC CGC 38756 
Trp Val Ala Gin Val Arg Glu Pro Val Arg Phe Ala Asp Ala Val Arg 
825 830 835 

ACG GCA CAC CGC CTC GGA GCC CGC ACC TTC CTG GAG ACC GGC CCG GAC 38804 
Thr Ala His Arg Leu Gly Ala Arg Thr Phe Leu Glu Thr Gly Pro Asp 
840 845 850 

GGC GTG CTG TGC GGC ATG GCA GAG GAG TCC CTG GAG GAC GAC ACC GTG 38852 
Gly Val Leu Cys Gly Met Ala Glu Glu Cys Leu Glu Asp Asp Thr Val 
855 860 865 

GCC CTG CTG CCG GCG ATC CAC AAG CCC GGC ACC GCG CCG CAC GGT CCG 38900 
Ala Leu Leu Pro Ala He His Lys Pro Gly Thr Ala Pro His Gly Pro 
870 875 880 

GCG GCT CCC GGC GCG CTG CGG GCG GCC GCC GCC GCG TAC GGC CGG GGC 38948 
Ala Ala Pro Gly Ala Leu Arg Ala Ala Ala Ala Ala Tyr Gly Arg Gly 
885 890 895 900 

GCC CGG GTG GAC TGG GCC GGG ATG CAC GCC GAC GGC CCC GAG GGG CCG 38996 
Ala Arg Val Asp Trp Ala Gly Met His Ala Asp Gly Pro Glu Gly Pro 
905 910 915 

GCC CGC CGC GTC GAA CTG CCC GTC CAC GCC TTC CGG CAC CGC CGC TAC 39044 
Ala Arg Arg Val Glu Leu Pro Val His Ala Phe Arg His Arg Arg Tyr 
920 925 930 

TGG CTC GCC CCG GGC CGC GCG GCG GAC ACC GAC GAC TGG ATG TAC CGG 39092 
Trp Leu Ala Pro Gly Arg Ala Ala Asp Thr Asp Asp Trp Met Tyr Arg 
935 940 945 

ATC GGC TGG GAC CGG CTG CCG GCT GTG ACC GGC GGG GCC CGG ACC GCC 39140 
He Gly Trp Asp Arg Leu Pro Ala Val Thr Gly Gly Ala Arg Thr Ala 
95 ° 955 960 

GGC CGC TGG CTG GTG ATC CAC CCC GAC AGC CCG CGC TGC CGG GAG CTG 39188 
Gly Arg Trp Leu Val lie His Pro Asp Ser Pro Arg Cys Arg Glu Leu 
965 970 975 ~ 980 

TCC GGC CAC GCC GAA CGC GCG CTG CGC GCC GCG GGC GCG AGC CCC GTA 39236 
Ser Gly His Ala Glu Arg Ala Leu Arg Ala Ala Gly Ala Ser Pro Val 
^85 990 995 

CCG CTG CCC GTC GAC GCT CCG GCC GCC GAC CGG GCG TCC TTC GCG GCA 39284 
Pro Leu Pro Val Asp Ala Pro Ala Ala Asp Arg Ala Ser Phe Ala Ala 
1000 1005 1010 

CTG CTC CGC TCC GCC ACC GGA CCT GAC ACA CGA GGT GAC ACA GCC GCG 39332 
Leu Leu Arg Ser Ala Thr Gly Pro Asp Thr Arg Gly Asp Thr Ala Ala 
1015 1020 1025 
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CCC CTC GCC GGT GTC CTG TCG CTC CTG TCC GAG GAG GAT CGG CCC CAT 39380 
Pro Val Ala Gly Val Leu Ser Leu Leu Ser Glu Glu Asp Arg Pro His 
1030 1035 1040 

CGC CAG CAC GCC CCG GTA CCC GCC GGG GTC CTG GCG ACG CTC TCC CTG 39428 
Arg Gin His Ala Pro Val Pro Ala Gly Val Leu Ala Thr Leu Ser Leu 
1045 1050 1055 1060 

ATC CAG GCT ATG GAG GAG GAG GCG GTG GAG GCT CGC GTC TCG TCC GTC 39476 
Met Gin Ala Met Glu Glu Glu Ala Val Glu Ala Arg Val Trp Cys Val 
1065 1070 1075 

TCC CGC GCC GCG GTC GCC GCC GCC GAC CGG GAA CGG CCC GTC GGC GCG 39524 
Ser Arg Ala Ala Val Ala Ala Ala Asp Arg Glu Arg Pro Val Gly Ala 
1080 1085 1090 

GGC GCC GCC CTC TCG GGG CTC GGG CGG GTC GCC GCC CTC GAA CGC CCC 39572 
Gly Ala Ala Leu Trp Gly Leu Gly Arg Val Ala Ala Leu Glu Arg Pro 
1095 1100 H05 

ACC CGG TCG GGC GGT CTC GTC GAC CTC CCC GCC TCG CCC GGT GCG GCG 39620 
Thr Arg Trp Gly Gly Leu Val Asp Leu Pro Ala Ser Pro Gly Ala Ala 
111° 1115 H20 

CAC TCG GCG GCC GCC GTC GAA CGG CTC GCC GGT CCC GAG GAC CAG ATC 39668 
His Trp Ala Ala Ala Val Glu Arg Leu Ala Gly Pro Glu Asp Gin He 
1125 H30 1135 H40 

GCC GTC CGC GCG TCC GGC AGT TCG GGC CGG CGC CTC ACC AGG CTC CCG 39716 
Ala Val Arg Ala Ser Gly Ser Trp Gly Arg Arg Leu Thr Arg Leu Pro 
1145 iiso H55 

CGC GAC GGC GGC GGC CGG ACG GCC GCA CCC GCG TAC CGG CCG CGC GGC 39764 
Arg Asp Gly Gly Gly Arg Thr Ala Ala Pro Ala Tyr Arg Pro Arg Gly 
1160 H65 H70 

ACG GTC CTC GTC ACC GGT GGC ACC GGC GCG CTC GGC GGG CAT CTC GCC 39812 
Thr Val Leu Val Thr Gly Gly Thr Gly Ala Leu Gly Gly His Leu Ala 
1175 H80 HQs 

CGC TGG CTC GCC GCG GCG GGC GCC GAA CAC CTC GCG CTC ACC AGC CGC 39860 
Arg Trp Leu Ala Ala Ala Gly Ala Glu His Leu Ala Leu Thr Ser Ara 
H90 1195 1200 

CGG GGC CCG GAC GCG CCC GGC GCC GCC GGA CTC GAG GCC GAA CTC CTC 39908 
Arg Gly Pro Asp Ala Pro Gly Ala Ala Gly Leu Glu Ala Glu Leu Leu 
1205 1210 1215 i 22 o 

CTC CTC GGC GCC AAG GTC ACG TTC GCC GCC TCC GAC ACC GCC GAC CGC 39956 
Leu Leu Gly Ala Lys Val Thr Phe Ala Ala Cys Asp Thr Ala Asp Arg 
1225 1230 1235 

GAC GGC CTC GCC CGG GTC CTC CGG GCG ATA CCG GAG GAC ACC CCG CTC 40004 
Asp Gly Leu Ala Arg Val Leu Arg Ala He Pro Glu Asp Thr Pro Leu 
1240 1245 1250 

ACC GCG GTC TTC CAC GCC GCG GGC GTA CCG CAG GTC ACG CCG CTC TCC 40052 
Thr Ala Val Phe His Ala Ala Gly Val Pro Gin Val Thr Pro Leu Ser 
1255 1260 1265 
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CGT ACC TCG CCC GAG CAC TTC GCC GAC GTG TAC GCG GGC AAG GCG GCG 

9 ^L Ser Pro Glu His Phe Ala Asp val Tyr Ala Gly Lys Ala Ala 
1270 1275 1280 

GGC GCC GCG CAC CTG GAC GAA CTG ACC CGC GAA CTC GGC GCC GGA CTC 
?;L • Ala Hls Leu As P Glu L «« Thr Arg Glu Leu Gly Ala Gly Leu 
1285 1290 1295 * 1300 

GAC GCG TTC GTC CTC TAC TCC TCC GGC GCC GGC GTC TGG GGC AGC GCC 
Asp Ala Phe Val Leu Tyr Ser Ser Gly Ala Gly Val Trp Gly Ser Ala 
"OS 1310 * 1315 

GGC CAG GGT GCC TAC GCC GCC GCC AAC GCC GCC CTG GAC GCG CTC GCC 
Gly Gin Gly Ala Tyr Ala Ala Ala Asn Ala Ala Leu Asp Ala Leu Ala 
1320 1325 1330 

CCG CGC CGT GCG GCG GAC GGA CTC CCC GCC ACC TCC ATC GCC TGG GGC 
Arg Arg Arg Ala Ala Asp Gly Leu Pro Ala Thr Ser lie Ala Trp Gly 
1335 1340 1345 

If 0 ^ GGC GGC GGC GGT ATC GGG GCC GAC GAG GCG GGC GCG GAG TAT 
Val Trp Gly Gly Gly Gly Met Gly Ala Asp Glu Ala Gly Ala Glu Tyr 
ij:>u 1355 1360 

CTG GGC CGG CGC GGT ATG CGC CCC ATG GCA CCG GTC TCC GCG CTC CGG 
Leu Gly Arg Arg Gly Met Arg Pro Met Ala Pro Val Ser Ala l2 2J 
55 1370 1375 nso 

GCG ATG GCC ACC GCC ATC GCC TCC GGG GAA CCC TGC CCC ACC GTC ACC 
Ala Met Ala Thr Ala He Ala Ser Gly Glu Pro Pro £r S£ J£ 
13 «5 i39 0 1395 

Hit £ f C 1°° ° AG CGC GAG GGC TTC ACC GCC TTC CGG CCC 

His Thr Asp Trp Glu Arg Phe Gly Glu Gly Phe Thr Ala Phe Arg Pro 
1400 i4os 1410 

AGC CCT CTG ATC GCG GGG CTC GGC ACG CCG GGC GGC GGC CGG GCG GCG 
Ser Pro Leu lie Ala Gly L.u Gly Thr Pro Gly Gly Gly SJ S Sa 
i41b 1420 1425 

^f 5 $1° n CC GAG GAG 000 ** C 000 ACC GCT CCG GCG GAC CTC ACC GCC 
Glu Thr Pro Glu Glu Gly Asn Ala Thr Ala Ala Ala Asp Leu Jhr Sa 
JU 1435 1440 

CTG CCG CCC GCC GAA CTC CGC ACC GCG CTG CGC GAG CTG GTG CGA GCC 
Lju Pro Pro Ala Glu Leu Arg Thr Ala Leu Arg Glu Su SJ £g £a 

1455 1460 

S S Sa S S ?° ° Afc CCG «* GAG GTC <** «« 

Arg Thr Ala Ala Ala Leu Gly Leu Asp Asp Pro Ala Glu Val Ala Glu 

146S 1470 1475 

GGC GAA CGG TTC CCC GCC ATG GGC TTC GAC TCC CTG GCC ACC GTA CGG 
Gly Glu Arg Phe^Pro Ala Met Gly PJ^Asp Ser Leu S i£ J2 

CTC CGC CGC GGA CTC GCC TCG GCC ACG GGC CTC GAC CTG CCC CCC GAT 
Leu Arg Arg Gly Leu Ala Ser Ala Thr Gly Leu Asp Leu Pro Pro Asp 
A * y:> 1500 1505 
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CTG CTC TTC GAC CGG GAC ACC CCG GCC GCG CTC GCC GCC CAC CTC GCC 
Leu Leu Phe Asp Arg Asp Thr Pro Ala Ala Leu Ala Ala His Leu Ala 
1510 1515 1520 

GAA CTG CTC GCC ACC GCA CGG GAC CAC GGA CCC GGC GGC CCC GGG ACC 
Glu Leu Leu Ala Thr Ala Arg Asp His Gly Pro Gly Gly Pro Gly Thr 
i525 1530 1535 1540 

GGT GCC GCG CCG GCC GAT GCC GGA AGC GGC CTC CCG GCC CTC TAC CGG 
Gly Ala Ala Pro Ala Asp Ala Gly Ser Gly Leu Pro Ala Leu Tyr Arg 
1545 1550 1555 

GAG GCC GTC CGC ACC GGC CGG GCC GCG GAA ATC GCC GAA CTC CTC GCC 
Glu Ala Val Arg Thr Gly Arg Ala Ala Glu Met Ala Glu Leu Leu Ala 
1560 1565 1570 

GCC GCT TCC CGG TTC CGC CCC GCC TTC GGG ACG GCG GAC CGG CAG CCG 
Ala Ala Ser Arg Phe Arg Pro Ala Phe Gly Thr Ala Asp Arg Gin Pro 
1575 1580 1585 

GTC GCC CTC GTC CCG CTC GCC GAC GGC GCG GAG GAC ACC GGG CTC CCG 
Val Ala Leu Val Pro Leu Ala Asp Gly Ala Glu Asp Thr Gly Leu Pro 
1590 1595 1600 

CTC CTC GTC GGC TCC GCC GGG ACG GCG GTC GCC TCC GGC CCG GTC GAG 
Leu Leu Val Gly Cys Ala Gly Thr Ala Val Ala Ser Gly Pro Val Glu 
1605 1610 1615 * 1620 

TTC ACC GCC TTC GCC GGA GCG CTC GCG GAC CTC CCG GCG GCG GCC CCG 
Phe Thr Ala Phe Ala Gly Ala Leu Ala Asp Leu Pro Ala Ala Ala Pro 
1625 1630 1635 

ATC GCC GCG CTC CCG CAG CCC GGC TTT CTG CCG GGA GAA CGA GTC CCG 
Met Ala Ala Leu Pro Gin Pro Gly Phe Leu Pro Gly Glu Arg Val Pro 
1640 1645 1650 

GCC ACC CCG GAG GCA TTC TTC GAG GCC CAG GCG GAA GCG CTC CTC CGC 
Ala Thr Pro Glu Ala Leu Phe Glu Ala Gin Ala Glu Ala Leu Leu Arg 
1655 1660 1665 

TAC GCG GCC GGC CGG CCC TTC GTC CTC CTC GGG CAC TCC GCC GGC GCC 

^ r ™ Ala Gly Arg Pr ° Phe Val Leu Leu °ly His Ser Ala Gly Ala 
1670 1675 1680 

AAC ATG GCC CAC GCC CTC ACC CGT CAT CTC GAG GCG AAC GGT GGC GGC 
Asn Met Ala His Ala Leu Thr Arg His Leu Glu Ala Asn Gly Gly Glv 
1685 1690 1695 Y i7^ 0 

CCC GCA GGG CTC GTC CTC ATC GAC ATC TAC ACC CCC GCC GAC CCC GGC 
Pro Ala Gly Leu Val Leu Met Asp lie Tyr Thr Pro Ala Asp Pro Glv 
1705 1710 1715 

GCG ATC GGC GTC TGG CGG AAC GAC ATC TTC CAG TCG GTC TGG CGG CGC 
Ala Met Gly Val Trp Arg Asn Asp Met Phe Gin Trp Val Trp Arg Arq 
1720 1725 1730 

TCG GAC ATC CCC CCG GAC GAC CAC CGC CTC ACG GCC ATC GGC GCC TAC 
Ser Asp He Pro Pro Asp Asp His Arg Leu Thr Ala Met Gly Ala Tyr 
1735 1740 1745 
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CAC CGG CTG CTT CTC GAG TGG TCG CCC ACC CCC GTC CGC GCC CCC GTA 
His Arg Leu Leu Leu Asp Trp Ser Pro Thr Pro Val Arg Ala Pro Val 
1750 17S5 1760 

CTC CAT CTC CGC GCC GCG GAA CCC ATC GGC GAC TCG CCA CCC GGG GAC 
Leu_His Leu Arg Ala Ala Glu Pro Met Gly Asp Trp Pro Pro Gly Asp 
1770 1775 1780 

is ss s «s r c « Ac too gac 000 000 cac acc acc « atc 

Thr Gly Trp Gin Ser His Trp Asp Gly Ala His Thr Thr Ala Gly He 
1785 1790 1795 

CCC GGA AAC CAC TTC ACG ATG ATC ACC GAA CAC GCC TCC GCC GCC GCC 
Pro Gly Asn His Phe Thr Met Met Thr Glu His Ala Ser Ala Ala Ala 
1800 1805 i8io 

SSS^^r 30 ^^^^^^^^ CAG GGC 
Arg Leu Val His Gly Trp Leu Ala Glu Arg Thr Pro Ser Gly Gin Gly 

1815 1820 i 8 25 

GGG TCA CCG TCC CGC GCG GCG GGG AGA GAG GAG AGG CCG TCA 
Gly Ser Pro Ser Arg Ala Ala Gly Arg Glu Glu Arg Pro • 
1830 1835 1840 



ACACGGCAGC CGGCCCGACC GGCACCGCCG CCGGCGGCAC CACCGCCCCG GCGGCGGCAC 
ACGACCTCTC CCGCGCCGGA CGCAGGCTCC AACTCACCCG GGCCGCACAG TCGTTCGCCG 
GCAACCAGGG AGACCCCTAC GGGATCATCC TGCGCGCCGG CACCGCCGAC CCGGCACCGT 
ACGAGGAAGA GATCCGTGAG CGGGGGCCGC TGTTCCACAG CGAACTCCTC GGCGCCTGGG 
TGACCGGCAG CCGCCATGTC GCCGACGCCG TCACGGCCGA CGACGCGTTC GGCGCCCTCA 
CCGCGGACGG TCCACGGCCA GGAGTCCGCG AACTGCCGCT CTCCGGCAGC GCCCTCGACG 
CCGCCCACGG GAACCCCGGC GGCCCGCCCC TCCCCGGAGG GTCGCCGCAC CGGCCCCCGG 
ACAGGGAGGA GCGAGACGAC CCGGACCGGC ACGCGGCGGA CCTCCTCAAC GCCGCCGGCC 
CGGGGCAGGT CCTCGACCTC GTCCCGTTCG CCCGGCGGCT GGCCGCCCGG ACGGCCGGCG 
CGTGGCTGGG CGTCCCGGCG GAACGGCTGC CGCGCTTCGA GACGGCACTC ACCGGCTCCC 
GCCGCGCCCT CGACGCCCTC CTCTCCCCCC AGCTCCTCGC CGACGCGCGG GCCGGACTCG 
CCGCCGAGGA GCCCCTGCGC GCCGTCCTCG GCGAGACCCC GGAGGCACGC GGACGTCCGC 
CCGGCGCGGT CGAGGCGGCC CGCGCGCACG CCGTCAGCGC GGCGGAGCCC ATCGCCGTCC 
TCCTCTGCAA CGCGGTGCGG GAACTCATCG AACGGCCGGC CCAGTCGCGG GCGCTCACCG 
CCGACCCCGG CCTGGCGGGC GCCGCGATCA CCGAAACACT GCTCTGGGCA CCGCCGGTCC 
GCCTGGAGAG CAGGGTGGCA CGCGAGACGG CCGTACTCGC CGGGCGGACG CTCCCCGCTC 
GAACCCATCT CGTCGTCCTC GCCGCCGCCG CCAACCGCGA CGCCTGCCGG AACGCCGGTC 
CGGCCGTCAC CGGCTTCGAC GTCCTCCGCC GCGCCTCGGA CGGCGGCCCC CAGCCCCACG 
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Ser His Phe Leu Gly Leu Arg Gly Pro Ser Leu Val Veil Asp Ser Ala 
180 - - — 



185 



190 



Gin Ser Ala Ser Leu Val Ala Val Gin Leu Ala Cys Glu Ser Leu Arg 
195 200 205 

Arg Gly Glu Thr Ser Leu Ala Val Ala Gly Gly Val Asn Leu lie Leu 
210 215 220 

Thr Glu Glu Ser Thr Thr Val Met Glu Arg Met Gly Ala Leu Ser Pro 
225 230 235 240 

Asp Gly Arg Cys His Thr Phe Asp Ala Arg Ala Asn Gly Tyr Val Arg 
245 250 255 

Gly Glu Gly Gly Gly Ala Val Val Leu Lys Pro Leu Asp Ala Ala Leu 
260 265 270 

Ala Asp Gly Asp Arg Val Tyr Cys Val lie Lys Gly Gly Ala Val Asn 
<*' 5 280 285 

Asn Asp Gly Gly Gly Ala Ser Leu Thr Thr Pro Asp Arg Glu Ala Gin 



295 



300 



Glu Ala Val Leu Arg Gin Ala Tyr Arg Arg Ala Gly Val Ser Thr Gly 



315 



320 



Ala Val Arg Tyr Val Glu Leu His Gly Thr Gly Thr Arg Ala Gly Asp 



330 



335 



Pro Val Glu Ala Ala Ala Leu Gly Ala Val Leu Gly Ala Gly Ala Asp 
340 345 35 0 

Ser Gly Arg Ser Thr Pro Leu Ala Val Gly Ser Val Lys Thr Asn Val 



360 



365 



Gly His Leu Glu Gly Ala Ala Gly lie Val Gly Leu lie Lys Ala Thr 



375 



380 



Leu cys Val Arg Lys Gly Glu Leu Val Pro Ser Leu Asn Phe 



390 



395 



Ser Thr 
400 



Pro Asn Pro Asp lie Pro Leu Asp Asp Leu Arg Leu Arg Val Gin Thr 
405 410 



415 



Glu Arg Gin Glu Trp Asn Glu Glu Asp Asp Arg Pro Arg Val Ala Gly 
a20 425 43 0 

Val Ser Ser Phe Gly Met Gly Gly Thr Asn Val His Leu Val He Ala 

440 445 

Glu Ala Pro Ala Ala Ala Gly Ser Ser Gly Ala Gly Gly Ser Gly Ala 

455 460 

Gly Ser Gly Ala Gly lie Ser Ala Val Ser Gly Val Val Pro Val Val 
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Val Ser Gly Arg Ser Arg Val Val Val Arg Glu Ala Ala Gly Arg 
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Ala Glu Val Val Glu Ala Gly Gly Val Gly Leu Ala Asp Val Ala Val 
500 505 510 

Thr Met Ala Asp Arg Ser Arg Phe Gly Tyr Arg Ala Val Val Leu Ala 
515 520 525 

Arg Gly Glu Ala Glu Leu Ala Gly Arg Leu Arg Ala Leu Ala Gly Gly 
530 535 540 

Asp Pro Asp Ala Gly Val Val Thr Gly Ala Val Leu Asp Gly Gly Val 
545 550 555 * 560 

Val Val Gly Ala Ala Pro Gly Gly Ala Gly Ala Ala Gly Gly Ala Gly 
565 570 575 

Ala Ala Gly Gly Ala Gly Gly Gly Gly Val Val Leu Val Phe Pro Gly 
580 585 590 

Gin Gly Thr Gin Trp Val Gly Met Gly Ala Gly Leu Leu Gly Ser Ser 
595 600 60S 

Glu Val Phe Ala Ala Ser Met Arg Glu Cys Ala Arg Ala Leu Ser Val 
61 0 615 620 

His Val Gly Trp Asp Leu Leu Glu Val Val Ser Gly Gly Ala Gly Leu 
625 "0 635 640 

Glu Arg Val Asp Val Val Gin Pro Val Thr Trp Ala Val Met Val Ser 
6 *5 650 655 

Leu Ala Arg Tyr Trp Gin Ala Met Gly Val Asp Val Ala Ala Val Val 
6 60 665 670 

Gly His Ser Gin Gly Glu He Ala Ala Ala Thr Val Ala Gly Ala Leu 
675 680 685 

Ser Leu Glu Asp Ala Ala Ala Val Val Ala Leu Arg Ala Gly Leu He 
690 695 700 

Gly Arg Tyr Leu Ala Gly Arg Gly Ala Met Ala Ala Val Pro Leu Pro 
705 710 715 720 

Ala Gly Glu Val Glu Ala Gly Leu Ala Lys Trp Pro Gly Val Glu Val 
725 730 735 

Ala Ala Val Asn Gly Pro Ala Ser Thr Val Val Ser Gly Asp Arg Arg 
740 745 750 

Ala Val Ala Gly Tyr Val Ala Val Cys Gin Ala Glu Gly Val Gin Ala 
755 760 765 

Arg Leu He Pro Val Asp Tyr Ala Ser His Ser Arg His Val Glu Asp 
770 775 780 

Leu Lys Gly Glu Leu Glu Arg Val Leu Ser Gly He Arg Pro Arg Ser 
785 790 795 " 800 

Pro Arg Val Pro Val Cys Ser Thr Val Ala Gly Glu Gin Pro Gly Glu 
805 810 815 
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Pro Val Phe Asp Ala Gly Tyr Trp Phe Arg Asn Leu Arg Asn Arg Val 
820 825 830 

Glu Phe Ser Ala Val Val Gly Gly Leu Leu Glu Glu Gly His Arg Arg 
835 840 845 

Phe He Glu Val Ser Ala His Pro Val Leu Val His Ala He Glu Gin 
850 855 860 

Thr Ala Glu Ala Ala Asp Arg Ser Val His Ala Thr Gly Thr Leu Arg 
865 870 875 880 

Arg Gin Asp Asp Ser Pro His Arg Leu Leu Thr Ser Thr Ala Glu Ala 
885 890 895 

Trp Ala His Gly Ala Thr Leu Thr Trp Asp Pro Ala Leu Pro Pro Gly 
900 905 910 

His Leu Thr Thr Leu Pro Thr Tyr Pro Phe Asn His His His Tyr Trp 
915 920 925 

Leu Asp Thr He Asp Gly Gly Gly Gly Asp Asp Ala Thr Gin Glu Lys 
930 935 940 

Glu Ser Gly Pro Leu Thr Arg Glu Leu Arg Gly Leu Pro Ser Ser Gin 
945 950 955 960 

Lys Gin Leu Gly Phe Leu Leu Asp Leu Val Cys Arg His Thr Ala Val 
965 970 " 975 

Val Leu Gly Leu Asp Thr Ala Ala Glu Val Asp Pro Asp Leu Ser Phe 
980 985 990 

Lys Lys Gin Gly He Gin Ser Met Thr Gly Val Glu Leu Arg Asn Arg 
995 1000 1005 

Leu Leu Thr Glu Thr Gly Leu Ala Leu Pro Thr Thr Leu Val Tyr Asp 
1010 1015 1020 

Arg Pro Thr Pro Arg Ala Leu Ala Gin Phe Leu His Thr Glu Leu Leu 
1025 1030 1035 1040 

Asp Gly Ser Pro Ser Gly Ser Val Leu Ala Pro Ala Gin Lys Ser Phe 
1045 1050 1055 

Glu Ala Gin Glu Pro lie Ala Val Val Gly Met Gly Cys Arg Phe Pro 
1060 1065 1070 

Gly Gly Val Gly Ser Pro Glu Ala Leu Trp Arg Leu Val Val Glu Gly 
1075 1080 1085 

Val Asp Ala Val Ser Pro Phe Pro Gly Asp Arg Gly Trp Asp Val Glu 
1090 1095 HOO 

Gly Leu Tyr Asp Pro Glu Pro Gly Val Ala Gly Lys Ser Tyr Val Arg 
1105 1110 1115 H20 

Glu Gly Gly Phe Leu His Asp Ala Ala Glu Phe Asp Ala Glu Phe Phe 
H25 H30 H35 
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Gly He Ser Pro Arg Glu Ala Val Ala Met Asp Pro Gin Gin Arg Leu 
1140 1145 ii 5 o 

Leu Leu Glu Thr Ser Trp Glu Ala He Glu Arg Ala Gly He Asp Pro 
1155 1160 H65 

His Ser Leu His Gly Ser Arg Thr Gly Val Tyr Ala Gly Val Met Pro 
1170 1175 H80 

Gin Glu Tyr Gly Pro Arg Leu Ala Glu Gly Ala Glu Gly Ser Asp Gly 
1185 "90 n95 1200 

Tyr Leu Leu Thr Cly Thr Ser Gly Ser Val Val Ser Gly Arg Val Ala 
12 °5 1210 1215 

Tyr Thr Leu Gly Leu Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys 
1220 1225 1230 

Ser Ser Ser Leu Val Ala Leu His Leu Ala Val Gin Ala Leu Arg 



1240 1245 



Gly 



Gly Glu Cys Asp Met Ala Leu Ala Gly Gly Val Thr Val Met Ala Gly 
LZil) 1255 1260 

Pro Gly Met Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Ala Asp 
25 1265 1270 1275 1280 

Gly Arg Cy S Lys Ala Phe Ala Asp Gly Ala Asp Gly Thr Ala Trp Ala 
2285 1290 1295 

Glu Gly Ala Gly Val Val Leu Val Glu Arg Leu Ser Asp Ala Arg Arg 
30 1300 1305 131Q 

Leu Gly His Pro Val Leu Ala Val Val Cys Gly Ser Ala Val Asn Gin 
1315 1320 1325 

35 Asp i3*, Ala Ser Asn G1 y ^ 5 Thr Ala Pro Ser Jjj Pto Ser Gln Glu 

Arg^Val He Arg Gin Ala Leu Gly Asn Ala Arg Leu Thr Val Ala Asp 

1350 1355 1360 

Val Asp Val Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro 
i365 1370 1375 

He Glu Ala Gin Ala Leu Leu Gly Thr Tyr Gly Arg Asp Arg Asp Gly 
1380 1385 1390 

Gly Arg Pro Val Trp Leu Gly Ser Leu Lys Ser Asn He Gly His Ala 
1:395 1400 1405 

Gin Ala Ala Ala Gly Val Ala Gly Val He Lys Met Val Leu Ala Met 
lqlv 1415 1420 

Arg^Tyr Gly Trp Leu Pro Arg Thr Leu His Val Asp Glu Pro Ser Arg 

1430 1435 14 4 0 

His Val Asp Trp Ser Ala Gly Gly Val Trp Leu Leu Thr Glu Ala Arg 
55 1445 1450 1455 
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Glu Trp Pro Gly Val Asp Arg Pro Arg Arg Ala Ala Val Ser Ala Phe 
14fi 0 1465 1470 

Gly Val Ser Gly Thr Asn Ala His Leu lie Leu Glu Ala Pro Asp Thr 
147 5 1480 1485 

Ala Glu Ala Glu Ser Ala Thr Thr Pro Val Arg Ser Glu Val Ser Glu 
1490 1495 1500 

Ser Ala Ala Val Leu Asp Ala Arg Ser Gly Val Val Pro Val Val Val 
1505. 1510 1515 1520 

Ser Gly Arg Ser Arg Val Val Val Arg Glu Ala Ala Gly Arg Leu Ala 
1525 1530 1535 

Glu Val Val Glu Ala Gly Gly Val Gly Leu Ala Asp Val Ala Val Thr 
1540 1545 1550 

Met Ala Gly Arg Ser Arg Phe Gly Tyr Arg Ala Val Val Leu Ala Arg 
1555 1560 15 6 5 a 

Gly ?^/ la G1U LSU A1 * Gly ** g Leu ^9 Ala Leu Ala Gly Gly Asp 
1570 1575 1580 

Pro Asp Ala Gly Val Val Thr Gly Ala Val Val Asp Pro Glu Thr Gly 
1585 "SO 1595 1600 

Ser Gly Gly Gly Gly Val Val Leu Val Phe Pro Gly Gin Gly Thr Gin 
1 6 °5 1610 1615 

Trp Val Gly Met Gly Ala Gly Leu Leu Gly Ser Ser Glu Val Phe Ala 
30 1620 1625 1630 

Ala Ser Met Arg Glu Cys Ala Arg Ala Leu Ser Val His Val Glu Trp 
1635 1640 1645 

Asp Leu L eu Glu Val Val Ser Gly Qly ^ ^ 
1650 1655 1660 

Val Val Gin Pro Val Thr Trp Ala Val Met Val Ser Leu Ala Arg Tyr 
1665 1670 1675 1680 

Trp Gin Ala Met Gly Val Asp Val Ala Ala Val Val Gly Hia Ser Gin 
I 685 1690 1695 

Gly Glu lie Ala Ala Ala Thr Val Ala Gly Ala Leu Ser Leu Glu Asp 
I 700 1705 1710 

Ala Ala Ala Val Val Ala Leu Arg Ala Gly Leu lie Gly Arg Tyr Leu 
1? 15 1720 1725 

Ala ?}L Arg Gly Ala Met Ala Ala Val Pro Leu Pro Ala Gly Glu Val 
I 730 1735 1740 

Glu Ala Gly Leu Ala Lys Trp Pro Gly Val Glu Val Ala Ala Val Asn 
1 11SC 1755 1760 

Gly Pro Ala Ser Thr Val Val Ser Gly Asp Arg Arg Ala Val Ala Gly 
SS 1765 1770 1775 
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Tyr Val Ala Val Cys Gin Ala Glu Gly Val Gin Ala Arg Leu He Pro 
1780 1785 1790 

Val Asp Tyr Ala Ser His Ser Arg His Val Glu Asp Leu Lys Gly Glu 
1795 1800 1805 

Leu Glu Arg Val Leu Ser Gly He Arg Pro Arg Ser Pro Arg Val Pro 
1810 1815 1820 

Val Cys Ser Thr Val Ala Gly Glu Gin Pro Gly Glu Pro Val Phe Asp 
1825 1830 1835 1840 

Ala Gly Tyr Trp Phe Arg Asn Leu Arg Asn Arg Val Glu Phe Ser Ala 
1845 1850 1855 

Val Val Gly Gly Leu Leu Glu Glu Gly His Arg Arg Phe He Glu Val 
I860 1865 1870 

Ser Ala His Pro Val Leu Val His Ala He Glu Gin Thr Ala Glu Ala 
1875 1880 1885 

Ala Asp Arg Ser Val His Ala Thr Gly Thr Leu Arg Arg Gin Asp Asp 
1890 1895 1900 

Ser Pro His Arg Leu Leu Thr Ser Thr Ala Glu Ala Trp Ala His Gly 
1905 1910 1915 ' 1920 

Ala Thr Leu Thr Trp Asp Pro Ala Leu Pro Pro Gly His Leu Thr Thr* 
1925 1930 1935 

Leu Pro Thr Tyr Pro Phe Asn His His His Tyr Trp Leu Asp Thr Thr 
1940 1945 1950 

Pro Thr Thr Pro Ala Thr Thr Thr Gin Ser Pro Thr Asp Ala Trp Arg 
1955 I960 1965 

Tyr Arg Val Thr Trp Lys Ala Leu Thr Glu Ser Ser Pro Val Arg Pro 
1970 1975 1980 

His Ser He Gly Arg Cys Leu Leu Val Ala Pro Pro Thr Thr Asp Gly 
1985 1990 1995 2000 

Glu Leu Leu Asp Gly Leu Thr Thr Val Leu Ser Glu Arg Gly Ala Ser 
2005 2010 2015 

Val Ala Arg Leu Glu Val Pro He Gly Ala Arg Arg Ala Glu Val Ala 
2020 2025 2030 

Glu Leu Leu Lys Pro Ser Met Glu Ser Ala Gly Glu Glu Asn Thr Thr 
2035 2040 2045 

Val Val Ser Leu Leu Gly Leu Val Pro Ser Thr Asp Ala Val Arg Thr 
2050 2055 2060 

Ser He Ala Leu Leu Gin Ala Val Ser Asp He Gly Val Pro Ala Ala 
2065 2070 2075 2 080 

Arg Val Trp Ala Leu Thr Arg Arg Ala Val Ala Val Val Pro Gly Glu 
2085 2090 2095 
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Thr Pro Gin Asp Ala Gly Ala Gin Leu Trp Gly Phe Gly Arg Val Ala 
2100 2105 2110 

Ala Leu Glu Leu Pro Asp lie Trp Gly Gly Leu He Asp Leu Pro Glu 
2115 2120 2125 

Thr Ala Glu Leu Thr Arg Thr Pro Glu Thr Ser Gin Pro Pro Gin Thr 
2130 2135 2140 

Pro Glu Arg Leu Pro Gin Thr Pro Asn Arg Arg Ala Leu Glu Leu Ala 
2145 2150 2155 2160 

Ala Ala Val Leu Ala Gly Arg Asp Gly Glu Asp Gin Val Ala Val Arg 
2165 2170 2175 

Ala Ser Gly He Tyr Gly Arg Arg Val Ser Arg Ala Ala Ala Ala Gly 
2180 2185 2190 

Ala Ala Ser Trp Gin Pro Ser Gly Thr Val Leu He Thr Gly Gly Met 
2195 2200 2205 

Gly Ala He Gly Arg Arg Leu Ala Arg Arg Leu Ala Ala Glu Gly Ala 
2210 2215 2220 

Glu Arg Leu Val Leu Thr Ser Arg Arg Gly Pro Glu Ala Pro Gly Ala 
2225 2230 2235 2240 

Ala Glu Leu Ala Glu Glu Leu Arg Gly His Gly Cys Glu Val Val His 
2245 2250 2255 

Ala Ala Cys Asp Val Ala Glu Arg Asp Ala Leu Ala Ala Leu Val Thr 
2260 2265 2270 

Ala Tyr Pro Pro Asn Ala Val Phe His Thr Ala Gly He Leu Asp Asp 
2275 2280 2285 

Ala Val He Asp Thr Leu Ser Pro Glu Ser Phe Glu Thr Val Arg Gly 
2290 2295 2300 

Ala Lys Val Cys Gly Ala Glu Leu Leu His Gin Leu Thr Ala Asp He 
2305 2310 2315 2320 

Lys Gly Leu Asp Ala Phe Val Leu Phe Ser Ser Val Thr Gly Thr Trp 
2325 2330 2335 

Gly Asn Ala Gly Gin Gly Ala Tyr Ala Ala Ala Asn Ala Ala Leu Asp 
2340 2345 2350 

Ala Leu Ala Glu Arg Arg Arg Ala Ala Gly Leu Pro Ala Thr Ser Val 
2355 2360 2365 

Ala Trp Gly Leu Trp Gly Gly Gly Gly Met Ala Ala Gly Ala Gly Glu 
2370 2375 2380 

Glu Ser Leu Ser Arg Arg Gly Leu Arg Ala Met Asp Pro Asp Ala Ala 
2385 2390 2395 2400 

Val Asp Ala Leu Leu Gly Ala Met Gly Arg Asn Asp Val Cys Val Thr 
2405 2410 2415 
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Val Val Asp Val Asp Trp Glu Arg Phe Ala Pro Ala Thr Asn Ala He 
2420 2425 2430 

Arg Pro Gly Arg Leu Phe Asp Thr Val Pro Glu Ala Arg Glu Ala Leu 
2435 2440 2445 

Thr Ala Ala Gly Thr Thr Ser Ala Thr Pro Asp Gly Ala Pro Glu Leu 
2450 2455 2460 

Ala Arg Arg Leu Ser Met Leu Asn Glu Thr Glu Arg Leu Arg Lys Leu 
2465 2470 2475 2480 

Val Glu Leu Val Arg Thr Glu Ala Ala Phe Val Leu Arg His Pro Asn 
2485 2490 2495 

Thr Asp Ala He Gly Ala Glu Arg Pro Phe Lys Ser Ala Gly Phe Asp 
2500 2505 2510 

Ser Leu Thr Ser Leu Glu Leu Arg Asn Arg Leu Asn Ala Gly Thr Gly 
2515 2520 2525 

Leu Lys Leu Pro Ala Thr Val He Phe Asp His Pro Ser Pro Thr Ala 
2530 2535 2540 

Leu Ala Arg Leu Leu Leu Asp Arg Leu Thr Gly Ala Gly Ala Pro Ala 
25 « 2550 2555 2560 

Pro Ala Ala Asp Glu Pro Pro Leu Pro Val Ala Val Ala Asp Asp Asp 
2565 2570 2575 

Pro Val Val He Val Gly Met Ala Cys Arg Phe Pro Gly Gly Ala Gly 
2580 2585 2590 

Thr Pro Glu Ala Leu Trp Lys Leu Val Thr Glu Glu Arg Asp Val He 
2595 2600 2605 

Gly Ala Ala Pro Thr Asp Arg Gly Trp Asp Leu Asp Ser Val Tyr Asp 
2610 2615 2620 

Pro Glu Pro Gly Val Ala Gly Lys Thr Tyr Val Arg Glu Gly Gly Phe 
2625 2630 2635 ~ 2640 

Leu His Asp Ala Ala Glu Phe Asp Ala Glu Phe Phe Gly He Ser Pro 
2645 2650 ' 2655 

Arg Glu Ala Val Ala Met Asp Pro Gin Gin Arg Leu Leu Leu Glu Thr 
2660 2665 2670 

Ser Trp Glu Ala lie Glu Arg Ala Gly He Asp Pro His Ser Leu His 
2675 2680 2685 

Gly Ser Arg Thr Gly Val Tyr Val Gly Leu Thr His Gin Glu Tyr Ala 
2690 2695 2700 

Ser Arg Leu His Glu Ala Pro Glu Glu Tyr Glu Gly Tyr Leu Leu Thr 
2705 2710 2715 2720 

Gly Lys Ser Ala Ser Val Val Ser- Gly Arg lie Ser Tyr Thr Leu Gly 
2725 2730 2735 
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Leu Glu Gly Pro Ser Leu Ser lie Asp Thr Ala Cys Ser Ser Ser Leu 
2740 2745 2750 

val Ala Leu His Asn Ala Ala Gin Ala Leu Arg Gly Gly Glu Cys Asp 
2755 2760 2765 

Met A i* ^ Ala G1 * G1 * v *l Thr Val Met Ala Ala Pro Gly Leu Phe 
2770 2775 2780 

Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Ala Asp Gly Arg Cys Lys 
2785 279 ° 2795 2800 

Ala Phe Ala Asp Gly Ala Asp Gly Thr Ala Trp Ala Glu Gly Ala Gly 
2805 2810 2815 

Val Val Leu Val Glu Arg Leu Ser Asp Ala Arg Arg Leu Gly His Pro 
2820 2825 2830 

Val ^ A i* c Val Val G1 Y Ser A1 * v *l Asn Gin Asp Gly Ala Ser 
2835 2840 2845 

Asn Gly Leu Thr Ala Pro Ser Gly Pro Ser Gin Glu Arg Val lie Arg 
2850 2855 2860 

2865 A1 * 2870^ ^ ^ 2875^ ^ ^ V ' 1 

Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro lie Glu Ala Gin 
2885 2890 2 895 

Ala Leu Leu Gly Thr Tyr Gly Arg Asp Arg Asp Ala Glu Cys Pro Val 
2900 2905 2910 

Trp Leu Gly Ser Leu Lys Ser Asn He Gly His Ala Gin Ala Ala Ala 
2915 2920 2925 

Ztlo kl * o^c Met Val LeU Ala Met Tyr Gly Trp 

<53 - ,u 2935 2940 

Leu Pro Arg Thr Leu His Val Asp Glu Pro Ser Arg His Val Asp Trp 

2950 2955 2960 

Ser Ala Gly Gly Val Arg Leu Leu Thr Glu Ala Arg Glu Trp Pro Gly 

2965 2970 2975 

Val Asp Arg Pro Arg ^ Ala Ala Val Ser Ma phe Gly ^ ^ 
2980 2985 2990 

A ;« HiS ^ 116 Leu Glu Ala Pro Glu Ala Leu Glu Ala Leu 
2995 3000 3005 

Glu Ala Thr Asp Ala Pro Glu Ala Pro Glu Ala Pro Glu Ala Pro Asp 
J01 ° 3015 3020 

Val Thr Asp Val Thr Glu Ala Leu Glu Ala Pro Asp Ala Thr Glu Ala 

3030 3035 3040 

Glu Gly Ala Lys Ala Pro Gly Ser Pro Glu Glu Ala Gin Pro Ala Val 
3045 3050 3055 
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Gly Val Val Pro Val Val Val Ser Gly Arg Ser Arg Val Val Val Arg 
3060 3065 3070 

Glu Ala Ala Gly Arg Leu Ala Glu Val Val Glu Ala Gly Gly Val Gly 
3075 3080 3085 

Leu Ala Asp Val Ala Val Thr Met Ala Gly Arg Ser Arg Phe Gly Tyr 
3090 3095 3100 

Arg Ala Val Val Leu Ala Arg Gly Glu Ala Glu Leu Ala Gly Arg Leu 
3105 3110 3115 3120 

Arg Ala Leu Ala Gly Gly Asp Pro Asp Ala Gly Val Val Thr Gly Ala 
3125 3130 3135 

Val Val Asp Pro Glu Thr Gly Ser Gly Gly Gly Gly Val Val Leu Val 
3140 3145 3150 

Phe Pro Gly Gin Gly Thr Gin Trp Val Gly Met Gly Ala Gly Leu Leu 
3155 3160 3165 

Gly Ser Ser Glu Val Phe Ala Ala Ser Met Arg Glu Cys Ala Arg Ala 
3170 3175 3180 

Leu Ser Val His Val Glu Trp Asp Leu Leu Glu Val Val Ser Gly Glv 
3185 3190 3195 3200 

Ala Gly Leu Glu Arg Val Asp Val Val Gin Pro Val Thr Trp Ala Val 
3205 3210 3215 

Met Val Ser Leu Ala Arg Tyr Trp Gin Ala Met Gly Val Asp Val Ala 
3220 3225 3230 

Ala Val Val Gly His Ser Gin Gly Glu lie Ala Ala Ala Thr Val Ala 
3235 3240 3245 

Gly Ala Leu Ser Leu Glu Asp Ala Ala Ala Val Val Ala Leu Arg Ala 
3250 3255 3260 

Gly Leu He Gly Arg Tyr Leu Ala Gly Arg Gly Ala Met Ala Ala Val 
3265 3270 3275 3280 

Pro Leu Pro Ala Gly Glu Val Glu Ala Gly Leu Ala Lys Trp Pro Gly 
3285 3290 " 3295 

Val Gin Val Ala Ala Val Asn Gly Pro Ala Ser Thr Val Val Ser Glv 
3300 3305 3310 

Asp Arg Arg Ala Val Ala Gly Tyr Val Ala Val Cys Gin Ala Glu Gly 
3315 3320 3325 

Val Gin Ala Arg Leu He Pro Val Asp Tyr Ala Ser His Ser Arg His 
3330 3335 3340 

Val Glu Asp Leu Lys Gly Glu Leu Glu Arg Val Leu Ser Gly He Ara 
3345 33 50 3355 3360 

Pro Arg Ser Pro Arg Val Pro Val Cys Ser Thr Val Ala Gly Glu Gin 
3365 3370 3375 
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Pro Gly Glu Pro Val Phe Asp Ala Gly Tyr Trp Phe Arg Asn Leu Arg 
3380 3385 3390 

Asn Arg Val Glu Phe Ser Ala Val Val Gly Gly Leu Leu Glu Gin Gly 
3395 3400 3405 

His Arg Arg Phe He Glu Val Ser Ala His Pro Val Leu Val His Ala 
3410 3415 3420 

He Glu Gin Thr Ala Glu Ala Ala Asp Arg Ser Val His Ala Thr Gly 
3425 3430 3435 3440 

Thr Leu Arg Arg Gin Asp Asp Ser Pro His Arg Leu Leu Thr Ser Thr 
3445 3450 3455 

Ala Glu Ala Trp Ala His Gly Ala Thr Leu Thr Trp Asp Pro Ala Leu 
3460 3465 3470 

Pro Pro Gly His Leu Thr Thr Leu Pro Thr Tyr Pro Phe Asn His His 
3475 3480 3485 

His Tyr Trp Ala Val Thr Ser Pro Ala Gly Val Gly Asp Ala Ala Ala 
3490 3495 3500 

Gly Arg Phe Gly Met Thr Trp Glu Asp His Pro Phe Leu Arg Gly Gly 
3505 3510 3515 3520 

Leu Pro Leu Ala Asp Ser Gly Glu Arg Val Phe Ala Gly Arg Leu Ala 
3525 3530 3535 

Gly Ser Glu His Asp Trp Leu Thr Asp His Ala Val Ser Gly Val Thr 
3540 3545 3550 

Leu Leu Pro Gly Thr Ala Phe Val Glu Phe Ala Leu His Ala Gly Ala 
3555 3560 3565 

Ala Thr Gly Cys Gly Arg Leu Glu Glu Leu Ser Val Glu Ala Pro Leu 
3570 3575 3580 

Val Leu Pro Ala Ala Gly Gly Val Arg Val Gin Met Arg Val Ser Ala 
3585 3590 3595 3600 

Ala Asp Glu Ser Gly Arg Arg Arg Val Ala He His Ser Ala Pro Glu 
3605 3610 3615 

Ala Ala Val His Ser Ala Ala Glu Gly Gly Asp Ser Ala Gly Val Trp 
3620 3625 3630 

Thr Arg His Gly Glu Gly Thr Leu Val Pro Asp Pro Glu Pro Thr Pro 
3635 3640 3645 

Pro Asp Ala Asp Trp Ala Arg Ala Trp Pro Pro Ala Gly Glu Arg Val 
3650 3655 3660 

Glu Pro Ala Glu Leu Tyr Glu Arg Phe Gly Ala Leu Gly Tyr Glu Tyr 
3665 3670 3675 3680 

Gly Glu Ala Phe Ala Gly Val Arg Ala Val Trp Arg Gin Pro Asp Ala 
3685 3690 3695 
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Leu Leu Ala Glu Val Leu Leu Pro Asp Arg Ala Ser Thr Gly Ala Gly 
3700 3705 3710 

Arg Phe Gly Val His Pro Ala Leu Leu Asp Ala Ala Leu Gin Pro Trp 
3715 3720 3725 

He Ala Gly Gly Leu Leu Glu Val Pro Glu Asp Ala Val Leu Leu Pro 
3730 3735 3740 

Phe Ala Trp Gin Gly Val Ser Leu Tyr Ala Thr Gly Ala Gly Ala Leu 
3745 3750 3755 3760 

Arg Val Arg Leu Thr Lys Ala Gly Asp Gly Ala Val Ser Leu Gin Ala 
3765 3770 3775 

Ala Asp Thr Ser Gly Ala Ala Val Leu Ser Leu Gly Ala Leu Val Met 
3780 3785 3790 

Arg Pro Leu Ala Arg Arg Lys Leu Asp Val Leu Leu Gly Thr Asp Ala 
3795 3800 3805 

Gly Glu Arg Ser Leu Tyr Arg Val Glu Trp Gin Pro Arg Leu Leu Pro 
3810 3815 3820 

Ala Gly Pro Pro Arg Ser Trp Ala Val Leu Gly Pro Asp Ala Asp Arg 
3825 3830 3835 3840 

Leu Ala Gly Thr Pro Gly Leu Gly Asp Gin Pro Asp Gly Gly Pro Thr 
3845 3850 3855 

Ala Leu Tyr Pro Glu Val Arg Ala Leu Arg Lys Ala Leu Ala Ala Gly 
3860 3865 3870 

Ala Pro Arg Pro Glu Ala Val Val Leu Pro Val Leu Ser Gly Ala Gly 
3875 3880 3885 

Ala Thr Pro Glu Ser Val Arg Gin Thr Thr Glu Arg Cys Leu Thr Ala 
3890 3895 3900 

Leu Gin Asp Trp Leu Asp Ala Glu Glu Leu Val Asp Thr Pro Leu He 
3905 3910 3915 3920 

Val Leu Thr Arg Gly Ala Val Ala Ala Val Pro Gly Glu Glu He Gly 
3925 3930 3935 

Asp Leu Ala Cys Ala Gly Val Trp Gly Leu Val Arg Ser Ala Arg Ser 
3940 3945 3950 

Glu His Pro Gly Arg Phe Ala Leu Val Asp Thr Asp Gly His Pro Asp 
3955 3960 3965 

Asp Arg Thr Ala Leu Pro Leu Ala Leu Arg Ala Val Leu Asp Gly Ala 
3970 3975 3980 

Gly Gin Leu Ser Leu Arg Ala Gly Thr Ala Arg Thr Pro Val Leu Leu 
3985 3990 3995 4000 

Arg Ala Gly Thr Pro Glu Glu Gin Arg Gly Pro Ala Phe Asp Pro Ala 
4005 4010 4015 
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Gly Thr Val Leu Val Thr Gly Ala Thr Gly Thr Leu Gly Arg Leu Leu 
4020 4025 4030 

Ala Arg His Leu Ala Ala Glu His Gly Val Arg His Leu Leu Leu Leu 
4035 4040 4045 

Ser Arg Gly Gly Arg Ala Ala Glu Gly Ala Asp Glu Leu Ala Ala Glu 
4050 4055 4060 

Leu Ala Gly Leu Glu Ala Glu Pro Cys Phe Ala Ala Cys Asp Ala Ala 
4065 4070 4075 4080 

Asp Arg Glu Ala Leu Ala Arg Val Leu Ala Glu Val Pro Ala Asp Arg 
4085 4090 4095 

Pro Leu Thr Gly Val He His Ala Ala- Gly Val Leu Asp Asp Gly Thr 
4100 4105 4110 

Leu Asp Ala Leu Thr Pro Glu Arg He Gly Thr Val Met Arg Pro Lys 
4H5 4120 4125 

Ala Asp Ala Ala Leu Asn Leu His Glu Leu Thr Arg Thr Ser Pro Leu 
4130 4135 4140 

Ser Val Phe Ala Val Phe Ser Gly Ala Ala Gly He Leu Gly Arq Pro 
4145 4150 4155 4160 

Gly Gin Ala Asn Tyr Ala Ala Ala Asn Thr Phe Leu Asp Ala Leu Ala 
4165 4170 4175 

Gin His Arg Arg Ala His Gly Leu Pro Ala Val Ser Leu Ala Trp Gly 
4180 4185 4190 

Leu Trp Gly Gly Ala Thr Gly Met Thr Gly His Leu Ser Gly Thr Asp 
4195 4200 4205 

Leu Arg Arg Met Arg Arg Ser Gly He Ala Pro Met Thr His Asp Gin 
4210 4215 4220 

Gly Leu Ala Leu Phe Asp Arg Ala Leu Ala Ala Ser Ala Glu Asp Pro 
4225 4230 4235 42 40 

Leu Leu Val Pro Met Arg Leu Asp Leu Ala Ala Leu Val Arg Glu Arg 
4245 4250 4255 

Ala Glu His Gly Pro Asp Ala Val Pro Gly Pro Leu Leu Gly Leu Leu 
4260 4265 4270 

Pro Ala Arg Ala Ala Val Arg Gin Ala Ala Ala Pro Val Arg Gly Gly 
4275 4280 428 5 

Ala Pro Ala Pro Ala Gly Gly Glu Gly Thr Ala Glu Arg Leu Ala Gly 
4290 4295 4 300 

Leu Gly Glu Glu Ala Arg Leu Arg Glu Leu Val Arg Leu Val Arg Ala 
4305 4310 4315 4320 

Glu' Val Ser Gly Val Leu Gly Tyr Ser Gly Pro Asp Ala Val Glu Pro 
4325 4330 4335 
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Gly Arg Pro Phe Lys Asp Leu Gly Phe Asp Ser Leu Thr Ala Val Glu 
4340 4345 4350 

Leu Arg Asn Arg Leu Gly Ala Ala Thr Gly Leu Arg Leu Pro Thr Ala 
4355 4360 4365 

Leu Val Phe Asp Arg Pro Thr Ser Gin Ala Val Ala Glu Tyr Leu Ala 
4370 4375 4380 

Ala Glu Leu Ala Gly Pro Arg Asp Gly Gly Asp Thr Ala Ala Ala Ala 
4385 4390 4395 4400 

Phe Glu Gly Leu Glu Ala Leu Ala Ala Ala Val Gly Ala Leu Ala Glu 
4405 4410 4415 

Asp Asp Leu Arg Arg Asp Val Leu Arg Arg Arg Leu Thr Glu Leu Ala 
4420 4425 4430 

Ala Ala Leu Thr Pro Gin Gly Arg Asn Pro Ser Ala Pro Ala Pro Ala 
4435 4440 4445 

Pro Ser Asp Leu Asp Glu Arg Leu Asp Ser Ala Asn Asp Asp Asp Leu 
4450 4455 4460 

Phe Ala Phe He Glu Glu Gin Leu * 
4465 4470 



(2) INFORMATION FOR SEQ ID NO: 3; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1865 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

Met Thr Ala Glu Leu Val Ala Thr Arg Lys Arg Leu Gly Ala Leu Glu 
15 10 15 

Glu Arg Ala Arg Glu Pro He Ala Val Val Ala Met Ser Cys Arg Tyr 
20 25 30 

Pro Gly Gly Val Thr Thr Pro Glu Asp Leu Trp Arg Leu Leu Ala Asp 
35 40 45 

Glu Arg Asp Ala Val Ser Gly Leu Pro Arg Asp Arg Gly Trp Asp Leu 
50 55 60 

Asp Ala Leu Tyr Asp Pro Asp Gly Gly Pro Gly Thr Ser Tyr Ala Arg 
65 70 75 80 

Glu Gly Gly Phe Leu Ser His Cys Ala Gly Phe Asp Ala Glu Phe Phe 
85 90 95 

Gly He Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg Leu 
100 105 HO 
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Leu Leu Glu Thr Ser Trp Glu Ala Leu Glu Arg Ala Gly Val Thr Ala 
115 120 125 

Asp Arg Ala Arg Gly Ser Arg Thr Gly Val Tyr Ala Gly Val Met Tyr 
130 135 140 

Asp Asp Tyr Gly Ala Arg Val Leu Tyr Gly Ala Gly Ala Gly Pro Pro 
145 150 155 160 



w 



Glu Asp Leu Glu Gly Tyr Leu Val Asn Gly Ser Ala Gly Ser He Ala 
165 170 175 



15 



Ser Gly Arg Val Ser Tyr Thr Phe Gly Leu Arg Gly Pro Ala Val Thr 
180 185 190 

Val Asn Thr Ala Cys Ser Ser Ser Leu Val Ser Leu His Leu Ala Val 
195 200 205 



20 



25 



Arg Ala Leu Arg Asn Gly Glu Cys Asp Met Ala Leu Ala Gly Gly Ala 
210 215 220 

Thr Val Leu Ser Thr Pro Thr Val Leu Val Asp Phe Ser Arg Gin Arg 
225 230 235 240 

Gly Leu Ala Pro Asp Gly Arg Cys Lys Ala Phe Ala Asp Ser Ala Asp 
245 250 255 

Gly Thr Ser Trp Ala Glu Gly Ala Gly Met Leu Leu Leu Gin Arg Leu 
260 265 270 



30 



Ser Asp Ala Arg Arg Glu Gly Arg Pro Val Leu Ala Val He Arg Gly 
275 280 285 



Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn 
290 295 300 



35 



40 



45 



Gly Arg Ala Gin Arg Gin Val He Glu Asp Ala Leu Arg Asp Ala Gly 
305 310 315 320 

Val Gly Pro Asp Gin Val Asp Ala Val Glu Ala His Gly Thr Gly Thr 
325 330 ' 335 

Glu Leu Gly Asp Pro He Glu Ala Gly Ala Leu Leu Ala Thr Tyr Gly 
340 345 350 

Thr Ala Arg Thr Ala Glu Arg Pro Leu Trp Leu Gly Ser Leu Lys Ser 
355 360 365 

Asn He Gly His Thr Gin Ala Ala Ala Gly Val Ala Gly Val He Lys 
370 375 380 



50 



55 



Met Val Leu Ala Met Arg His Gly Arg Leu Pro Arg Thr Leu His Val 

385 390 395 400 

Asp Arg Pro Thr Thr Arg Val Asp Trp Glu Lys Gly Gly Val Arg Leu 
405 410 ' 415 

Leu Thr Glu Pro Val Pro Trp Pro Gly Glu Ala Gly Glu Pro Arg Arg 
420 425 430 
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Ala Gly Val Ser Ser Phe Gly Ala Ser Gly Thr Asn Ala His Val Val 
435 440 445 

Leu Glu Ser Val Pro Ala Gly Glu Pro Pro Ala Ala Gly Arg Pro Glu 
450 455 460 

Asp Thr Gly Gly Ala Trp Thr Val Ser Gly Arg Gly Pro Ala Ala Leu 
465 470 475 480 

Arg Ala Gin Ala Ala Arg Leu Tyr Asp Ala Leu Thr Gly Thr Gly Thr 
485 490 495 

Gly Thr Gly Gin Gly Ala Gly Gin Gly Ala Gly Pro Gly Thr Ala Glu 
500 505 510 

Val Ala Gly Ala Leu Ala His Ala Arg Thr Ala Phe Arg His Arg Ala 
515 520 525 

Val Val Leu Gly Gly Asn Arg Ala Glu Leu Leu Ala Gly Leu Arg Glu 
530 535 540 

Leu Ala Glu Glu Glu His Pro Gly Pro Arg Val Val Thr Gly Thr Ala 
545. 550 555 " 560 

Pro Ala Thr Glu Arg Arg Thr Ala Phe Leu Phe Ser Gly Gin Gly Ser 
565 570 575 

Gin Arg Ala Gly Ser Gly Arg Gly Leu Tyr Arg Arg His Pro Val Phe 
580 585 590 

Ala Arg Ala Leu Asp Glu Val Cys Ala Ala Leu Glu Pro His Leu His 
595 600 605 

Arg Pro Leu Arg Asp Leu Met Phe Ala Glu Pro Gly Ser Pro Glu Ala 
610 615 620 

Glu Pro Leu Asp Arg Thr Glu Phe Thr Gin Pro Ala Leu Phe Ala Leu 
625 630 635 640 

Gin Thr Ala Leu Phe Arg Leu Ala Glu His His Gly Leu Arg Ala Glu 
645 650 655 

Ala Leu Cys Gly His Ser Val Gly Glu He Ala Ala Ala His Ala Ala 
660 665 670 

Gly Val Leu Thr Leu Pro Asp Ala Ala Arg Leu Val Ala Ala Arg Gly 
675 680 685 

Arg Leu Met Gin Ala Leu Pro Ala Gly Gly Ala Met Ala Ala Leu Arg 
690 695 700 



Ala Thr Ala Glu Glu He Ala Pro Leu Leu Glu Arg Arg Ala Gly Glu 
705 710 715 720 

Leu Ala Leu Ala Ala Val Asn Gly Pro Ser Ser Val Val Val Ser Gly 
725 730 735 

Asp Glu Ala Ala Val Leu Glu Leu Leu Glu Gin Trp Arg Ala Glu Gly 
740 745 750 
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Arg Glu Ala Arg Arg Leu Ala Val Ser His Ala Phe His Ser Pro Arg 
755 760 765 

Met Asp Gly Met Leu Thr Gin Phe Asp Arg Val Ala Arg Thr Leu Thr 
770 775 780 

Phe Ala Pro Pro Thr He Pro Leu Val Ser Thr Leu Thr Gly Thr Pro 
7 « 790 795 800 

Val Thr Glu Glu Thr Leu Cys Thr Ala Asp His Trp Val Arg Gin Ala 
805 810 815 

Arg Glu Pro Val Arg Phe Leu Asp Ala Met Arg Thr Leu Arg Ala Asp 
820 825 830 

Gly He Asp Thr Phe Val Glu Leu Gly Pro Asp Gly Val Leu Ser Ala 
835 840 845 

Met Ala Arg Asp Cys Ala Asp Asp Arg Pro Asp Gly Asp Thr Thr Gly 
850 855 860 

Ala Gly Asp Gly Glu Thr Pro Asp Pro Leu Leu Thr Leu Pro Leu Leu 
865 870 875 880 

Arg Arg Ser Val Pro Glu Thr Gly Asp Ala Glu His Pro Gly Gly Phe 
885 890 895 

Glu Arg Ala Leu Ala Thr Ala Tyr Ala His Gly Val Pro Leu Arg Leu 
900 905 910 

Ala Pro Ala Pro Asp Ala Ala Ser Leu Ala Val Ala Ala Glu Leu Pro 
915 920 925 

Thr Tyr Ala Phe Gin Arg Thr His Tyr Trp Leu Asp Ala Pro Ala Ala 
930 935 940 

Pro Ala Ala Leu Pro Ala Gly Leu Asp Asp Ala Gly His Pro Leu Leu 
945 950 955 960 

Ser Ala Ala Leu Asp Leu Pro Gly Gly Arg Gly Thr Val Trp Thr Gly 
965 970 975 

Ala Leu Ser Ala Ala Thr Leu Pro Trp Ala Ala Asp His Ser Val His 
980 985 * 990 

Gly Arg Thr Val Leu Pro Gly Thr Ala Leu Leu Asp Leu Ala Leu His 
995 1000 1005 

Ala Ala Pro Arg Val Gly Glu Leu Thr Phe Glu Ala Pro Leu Val Leu 
1010 1015 1020 

Pro Glu Asp Gly Glu Val Arg Leu Arg Val Val Leu Ala Glu Pro Asp 
1025 1030 1035 1040 

Ala Ser Gly Val Arg Glu Leu Ser Val His Ser Ala Gly Glu Asp Gly 
1045 1050 1055 

Gly Trp Thr Arg His Ala Thr Ala Val Leu Asp Thr Gly Thr Thr Thr 
1060 1065 1070 
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Ala Gly Glu Pro Ala Gly Ala Pro Pro Ala Ala Trp Pro Pro Gly Asp 
1075 1080 1085 

Ala Glu Pro Leu Asp Leu Ala Ala Glu Tyr Glu Arg Phe Ala Asp Ala 
1090 1095 1100 

Gly He Gly Tyr Gly Pro Ala Phe Arg Gly Leu Arg Ser Ala Trp Arg 
1105 mo H15 1120 

Asp Gly Asp Ala He Leu Ala Asp Val Arg Leu Pro Gly Glu Leu Ala 
1125 1130 1135 

Gly Glu Ala Asp Arg Tyr Gly lie His Pro Ala Leu Leu Asp Ala Ala 
1140 H45 1150 

Leu His Thr Ala Ala Ala Ala Leu Gly Gly Ala His Gly Met Leu Pro 
1155 H60 1165 

Phe Thr Trp Asn Gly Val Thr Leu His Ala Arg Gly Ala His Ala He 
1170 H75 1180 

Arg Val Arg Leu Thr Pro Ala Gly Pro Asp Ala Val Ala Val Thr Ala 
H85 1190 1195 1200 

Val Asp Pro Ala Gly Arg Pro Val Phe Thr Ala Ala Ser Leu Thr Leu 
1205 1210 1215 

Arg Pro Val Thr Thr Gly Gin Leu Thr Ala Ala Glu Ala Ala Arg Ala 
1220 1225 1230 

Pro Leu Tyr Arg Val Arg Trp Thr Gly Leu Pro Asp Thr Gly Thr Ala 
1235 1240 1245 

Arg Asp His Thr Trp Ala Val Ala Gly Gly Pro Gly Asp Leu Leu Pro 
1250 1255 1260 

Gly Glu Thr Pro His His Pro Asp Leu Ala Ser Ala Leu Ala Asp Thr 
1265 1270 1275 1280 

Gly Thr Ala Pro Phe Arg Val Leu Ala Asp Leu Arg Gly Tyr Gly Thr 
1285 1290 ~ 1295 

Ala Thr Pro Arg Glu Leu Ala Ser Gin Ala Leu Ala Leu Val Gin Gin 
1300 1305 1310 

Trp Ala Asp Ala Ala Glu Ala Ala Glu Gly Arg Leu Val Leu Val Thr 
1315 1320 1325 

Arg Arg Ala Val Asp He Gly Asp Gly Val Thr Asp Pro Ala Ala Ala 
1330 1335 1340 

Thr Val Trp Gly Leu Val Arg Ala Ala Gin Ser Glu His Pro Gly Cys 
1345 1350 1355 1360 

Phe Ala Leu Leu Asp Thr Asp Asp Ser Pro Arg Ser Arg Gin Leu Leu 
1365 1370 1375 

Pro Arg Val Ala Gly Thr Ala Glu Gin Leu Ala Leu Arg Asp Gly Thr 
1380 1385 1390 
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Leu Leu Ala Pro Ser Leu Thr Arg Ala Thr Leu Pro Ala Gly Ala Arg 
1395 1400 1405 

Leu Pro Ala Leu Asp Gly Thr Val Leu He Thr Gly Gly Thr Gly Ser 
5 1410 1415 1420 

Leu Gly Ala Glu Ala Ala Arg His Leu Val Thr Arg His Gly Ala Arg 
1425 1430 1435 1440 

w Arg Leu Leu Leu Thr Ser Arg Ser Gly Pro Gin Ala Pro Gly Ala Ala 

1445 1450 1455 

Glu Leu Val Ala Glu Leu Ala Ala Leu Gly Ala His Ala Asp Val Ala 
1460 1465 1470 

15 Ala Cys Asp Val Ala Asp Arg Ala Ala Leu Arg Ala Leu Leu Asp Arg 

1475 1480 1485 

Val Pro Ala Gly His Pro Leu Thr Ala Val Leu His Thr Ala Gly Val 
1490 1495 1500 

Leu Asp Asp Gly Val Leu Thr Ala Gin Thr Pro Gin Arg Leu Ala Ala 
1505 1510 1515 1520 

Val Leu Arg Pro Lys Ala Asp Ala Val Arg Asn Leu His Glu Leu Thr 
1525 1530 1535 

Gin Gly His Ala Leu Ser Ala Phe He Leu Tyr Ser Ser Ala Ala Gly 
1540 1545 1550 

Val Leu Gly Ser Ala Gly Gin Ser Gly Tyr Ala Ala Ala Asn Ala Tyr 
30 1555 1560 1565 

Leu Asp Ser Phe Ala Val Trp Arg Arg Ser Arg Gly Leu Pro Ala Val 
1570 1575 1580 



20 



25 



35 



Ser Leu Gly Trp Gly Pro Trp Asp Gly Gly Gly Met Ala Ser Gly Leu 
1585 1590 1595 1600 

Gly Gly Thr Asp Thr Ala Arg Leu Arg Arg Ser Gly He Ala Pro Leu 
1605 1610 1615 

40 Ser Arg Ala Glu Gly Leu Ala Ala Leu Asp Ala Ala Leu Ala Ala Gly 

1620 1625 1630 

Gly Asp Asp Thr Ala Pro Ala His Leu Leu Pro He Arg Val Asp Ala 
1635 1640 1645 

45 

Val Thr Leu Arg Gly Ala Asp Thr Val Pro Ala Val Leu Arg Asp Leu 
1650 1655 1660 

Ala Gly Thr Ala Pro Ser Ala Ala Glu Arg Pro Pro Gly Thr Pro Glu 
1665 1670 1675 1680 

Asp Thr Asn Ala Pro Leu Ala Asp Val Thr Gin Leu His Gly Arg Glu 
1685 1690 1695 

Arg Lys Glu Ala Leu Thr Gly Phe Val Arg Ala Gin Val Ala Ala Val 
55 1700 1705 1710 



so 
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Leu Gly His Pro Thr Ser Asp Thr lie Asp Val Arg Arg Ser Phe Lys 
1715 1720 1725 

Glu Ala Gly Phe Asp Ser Leu Thr Ala Val Glu Leu Arg Asn Arg Leu 
1730 1735 1740 

Arg Ala Ala Thr Gly Leu Lys Leu Pro Ala Thr Leu Val Phe Asp His 
17^5 1750 1755 1760 

Pro Thr Pro Leu Ala Leu Ala Gly Phe Leu His Arg Glu Leu Pro Gly 
1765 1770 1775 

Ala Glu Ala Ser Leu Met Ser Ala He Asp Thr Leu Arg His Arg Leu 
1780 1785 1790 

Arg Asp Ala Leu Ala Asp Asp Ala Ala Asp Asp Ala Leu Arg Asp Gin 
1795 1800 1805 

He Thr Arg Arg Leu Glu Thr Leu Leu Ala Gly He Ala Arg Thr Glu 
1810 1815 1820 

Glu Pro Ala Pro Ala Thr Ala Ala Ala Asp Asp Gly Ser Gly Ala Gly 
1825 1830 1835 1840 

Asp Val Ala Glu Arg Leu Ser Thr Ala Ser Asp Asp Glu Leu Phe Glu 
1845 1850 1855 

Leu Leu Asp Ser Gly Phe Thr Pro * 

I860 1865 



(2) INFORMATION FOR SEC ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3730 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Ser Thr Glu Asn Ser Thr Asn Val Pro Ala Ser Glu Asp Lys Leu 
15 10 15 

Arg Ala Tyr Leu Arg Arg Ala Met Ala Asp Leu His Glu Ser Arg Glu 
20 25 30 

Arg Leu Arg Ala Thr Glu Ala Arg Ala Gin Glu Pro He Ala Val Val 
35 40 45 

Gly Met Gly Cys Arg Phe Pro Gly Gly Val Gly Ser Pro Glu Ala Leu 
50 55 60 

Trp Arg Leu Val Val Glu Gly Val Asp Ala Val Ser Pro Phe Pro Gly 
65 70 75 80 

Asp Arg Gly Trp Asp Val Glu Gly Leu Tyr Asp Pro Glu Pro Gly Val 
85 90 95 
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Ala Gly Lys Ser Tyr Val Arg Glu Gly Gly Phe Leu His Asp Ala Ala 
100 105 HO 

Glu Phe Asp Ala Glu Phe Phe Gly He Ser Pro Arg Glu Ala Val Ala 
115 120 125 



10 



15 



20 



25 



30 



35 



40 



45 



SO 



55 



Met Asp Pro Gin Gin Arg Leu Leu Leu Glu Thr Ser Trp Glu Ala He 
130 135 140 

Glu Arg Ala Gly He Asp Pro His Ser Leu His Gly Ser Arg Thr Gly 
145 150 155 160 

Val Tyr Ala Gly Val Met Tyr His Asp Tyr Gly Thr Gly Gin Thr Ser 
165 170 175 

Ala Thr Asp Thr Ser Gly Tyr Ser Gly Thr Gly Thr Ser Gly Ser Val 
180 185 190 

Val Ser Gly Arg Val Ala Tyr Thr Leu Gly Leu Glu Gly Pro Ala Val 
195 200 205 

Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Leu Ala 
210 215 220 

Val Gin Ala Leu Arg Gly Gly Glu Cys Asp Met Ala Leu Ala Gly Gly 
225 230 235 240 

Val Thr Val Met Ala Gly Pro Gly Met Phe Val Glu Phe Ser Arg Gin 
245 250 255 

Arg Gly Leu Ala Ala Asp Gly Arg Cys Lys Ala Phe Ala Asp Gly Ala 
260 265 270 

Asp Gly Thr Ala Trp Ala Glu Gly Ala Gly Val Val Leu Val Glu Arg 
275 280 285 

Leu Ser Asp Ala Arg Arg Leu Gly His Pro Val Leu Ala Val Val Cys 
290 295 300 

Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro 
305 310 315 320 

Ser Gly Pro Ser Gin Glu Arg Val He Arg Gin Ala Leu Ala Asn Ala 
325 330 335 

Arg Leu Thr Val Ala Asp Val Asp Val Val Glu Ala His Gly Thr Gly 
340 345 350 

Thr Arg Leu Gly Asp Pro He Glu Ala Gin Ala Leu Leu Gly Thr Tyr 
355 360 365 

Gly Arg Asp Arg Asp Gly Gly Arg Pro Val Trp Leu Gly Ser Leu Lvs 
370 375 380 

Ser Asn He Gly His Ala Gin Ala Ala Ala Gly Val Ala Gly Val He 
385 390 395 400 

Lys Met Val Leu Ala Met Arg Tyr Gly Trp Leu Pro Arg Thr Leu His 
405 410 415 
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Val Asp Glu Pro Ser Arg His Val Asp Trp Ser Ala Gly Gly Val Trp 
420 425 430 

Leu Leu Thr Glu Ala Arg Glu Trp Pro Gly Val Asp Arg Pro Arg Arg 
"5 440 445 

Ala Ala Val Ser Ala Phe Gly Val Ser Gly Thr Asn Ala His Leu He 
450 455 460 

Leu Glu Ala Pro Asp Thr Ala Glu Ala Glu Ser Ala Thr Thr Pro Val 
465 4 ?0 475 480 

Arg Ser Glu Val Ser Glu Ser Ala Ala Val Phe Asp Ala Arg Ser Gly 
485 490 495 

Val Val Pro Val Val Val Ser Gly Arg Ser Arg Val Val Val Arg Glu 
500 505 510 

Ala Ala Gly Arg Leu Ala Glu Val Val Glu Ala Gly Gly Val Gly Leu 
515 520 525 



Ala Asp Val Ala Val Thr Met Ala Gly Arg Ser Arg Phe Gly Tyr Arg 
b30 535 540 

Ala Val Val Leu Ala Arg Gly Glu Ala Glu Leu Ala Gly Arg Leu Arg 



550 



555 



560 



Ala Leu Ala Gly Gly Asp Pro Asp Ala Gly Val Val Thr Gly Ala Val 
565 570 



575 



Val Asp Pro Glu Thr Gly Ser Gly Gly Gly Gly Val Val Leu Val Phe 
580 585 590 

Pro Gly Gin Gly Thr Gin Trp Val Gly Met Gly Ala Gly Leu Leu Gly 
595 J 



600 



605 



Ser Ser Glu Val Phe Ala Ala Ser Met Arg Glu Cys Ala Arg Ala Leu 



615 



620 



Ser Val His Val Gly Trp Asp Leu Leu Glu Val Val Ser Gly Gly Ala 



630 



635 



640 



Gly Leu Glu Arg Val Asp Val Val Gin Pro Val Thr Trp Ala Val Met 
645 650 655 

Val Ser Leu Ala Arg Tyr Trp Gin Ala Met Gly Val Asp Val Ala Ala 
660 665 670 

Val Val Gly His Ser Gin Gly Glu He Ala Ala Ala Thr Val Ala Gly 
675 680 685 



Ala Leu Ser Leu Glu Asp Ala Ala Ala Val Val Ala Leu Arg Ala 
690 695 700 

Leu lie Gly Arg Tyr Leu Ala Gly Arg Gly Ala Met Ala Ala Val Pro 



710 



715 



720 



Leu Pro Ala Gly Glu Val Glu Ala Gly Leu Ala Lys Trp Pro Gly Val 



725 



730 



Gly 
735 
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Glu Val Ala Ala Val Asn Gly Pro Ala Ser Thr Val Val Ser Gly Asp 
740 745 750 

Arg Arg Ala Val Ala Gly Tyr Val Ala Val Cys Gin Ala Glu Gly Val 
755 760 765 

Gin Ala Arg Leu He Pro Val Asp Tyr Ala Ser His Ser Arg His Val 
770 775 780 

Glu Asp Leu Lys Gly Glu Leu Glu Arg Val Leu Ser Gly He Arg Pro 
785 790 795 800 

Arg Ser Pro Arg Val Pro Val Cys Ser Thr Val Ala Gly Glu Gin Pro 
80S 810 815 

Gly Glu Pro Val Phe Asp Ala Gly Tyr Trp Phe Arg Asn Leu Arg Asn 
820 825 830 

Arg Val Glu Phe Ser Ala Val Val Gly Gly Leu Leu Glu Glu Gly His 
835 840 845 

Arg Arg Phe He Glu Val Ser Ala His Pro Val Leu Val His Ala He 
fi 50 855 860 

Glu Gin Thr Ala Glu Ala Ala Asp Arg Ser Val His Ala Thr Gly Thr 
865 870 875 880 

Leu Arg Arg Gin Asp Asp Ser Pro His Arg Leu Leu Thr Ser Thr Ala 
885 890 895 

Glu Ala Trp Ala His Gly Ala Thr Leu Thr Trp Asp Pro Ala Leu Pro 
900 905 910 

Pro Gly His Leu Thr Thr Leu Pro Thr Tyr Pro Phe Asn His His His 
915 920 925 

Tyr Trp Leu Asp Thr Thr Pro Thr Thr Pro Ala Thr Thr Thr Gin Ser 
930 935 940 

Pro Thr Asp Ala Gin Asn Pro Ala Asp Ala Leu Pro Tyr Lys Val Ser 
945 950 955 960 

Trp Lys Arg Leu Arg Asp Gin Asp Ser Leu Thr Ala Arg Leu Asp Gly 
965 970 975 

Arg Trp Leu Leu Val Val Pro Glu Ala Ser Ala Asp Pro Ser Val Ala 
980 985 990 

Glu Gly Val Ala Arg Glu Leu Thr Ala Arg Gly Ala Thr Val Glu Ser 
"5 1000 1005 

Leu Thr Val Glu Pro Gly Ala Asp Arg Ser Arg Leu Arg Gly Leu Leu 
10 " 1015 1020 

Val Asp Ala Thr Glu Arg Asp Glu Ala Gly Pro Leu Arg Gly He Val 
1025 1030 1035 1040 

Ser Leu Leu Ala Leu Ala Gly Asp His Ala Gly Ala Asp Gly Ala Arg 
1045 1050 1055 
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Pro Val Val Pro Ala Gly Leu Ala Ala Ser Leu Ala Leu lie Gin Ala 
1060 1065 1070 

Ala Gly Asp Ala Gly Thr Glu Ala Gly Leu Trp Ala Val Thr Arg Gly 
1075 1080 1085 

Ala Val Ala Ala Val Pro Gly Asp Val Pro Ala Pro Ser Gin Ala Leu 
1090 1095 1100 

Leu Trp Gly Phe Gly Arg Val Ala Gly lie Glu Leu Pro His Cys Trp 
1105 1110 1115 1120 

Gly Gly Leu Leu Asp Leu Pro Thr Gly Pro Gly Asp Ser Gly Phe Arg 
1125 1130 1135 

Gin Leu Ala Ala Thr Leu Ala Gly Arg Pro Ala Glu Asp Gin Val Ala 
1140 1145 1150 

Leu Arg Ala Ser Gly Ala Tyr Gly Arg Arg Leu Val Arg Ala Ser Ala 
1155 1160 1165 

Ala Gly Gly Ala Asp Gly Trp Arg Pro Arg Gly Thr Val Leu Val Val 
1170 1175 1180 

Gly Asp Thr Ala Glu Val Ala Gly Pro Leu Val Arg Trp Leu Leu Gly 
1185 1190 1195 1200 

Asn Gly Ala Arg Arg Val Thr Leu Ser Gly Leu Ser Gly Pro Leu Pro 
1205 1210 1215 

Glu Glu Leu Ala Asp Val Ala Ala Arg Val Thr Val Ala Pro Cys Asp 
1220 1225 1230 

Pro Ala Asp Arg Pro Ala Leu Arg Thr Leu Leu Ala Glu Gin Ala Pro 
1235 '1240 1245 

Thr Ala Val Leu Val Ala Pro Pro Ala Val Pro Pro Thr Pro Leu Ala 
1250 1255 1260 

Glu Met Thr Ala Glu Ala Leu Ala lie Ala Leu Ser Ala Lys Thr Gly 
1265 1270 1275 1280 

Leu Val Asp Arg Leu Asp Ser Leu Leu Asp Glu Pro Asp Pro Leu Leu 
1285 1290 1295 

Glu Asp Gly Glu Leu Asp Ala Phe Val Val Phe Ser Ser Val Ala Gly 
1300 1305 1310 

Val Trp Gly Gly Ala Gly Gin Gly Gly Tyr Ala Ala Gly Thr Ala Tyr 
1315 1320 1325 

Leu Asp Ala Leu Ala Glu Cys Arg Arg Ala Gly Gly Leu Pro Val Thr 
1330 1335 1340 

Ser Val Ala Trp Thr Pro Trp Leu Gly Thr Pro Ala Ala Asp Ser Leu 
1345 1350 1355 1360 

Gly Glu Gin Met Ser Arg Ala Gly He Thr Pro Leu Asp Pro Ala Ala 
1365 1370 1375 



90 



EP 0 791 655 A2 



Ser Leu Asp Ala Leu Ala Arg Ala Val Gly Arg Arg Ala Gly Cys Val 
1380 1385 1390 

Thr Val Ala Asp He Asp Trp Glu Arg Phe Ala Ser Ala Tyr Thr Ala 
1395 1400 1405 

Thr Arg Pro Thr Pro Met Phe Asp Glu Val Pro Glu Val Arg Arg He 
1410 1415 1420 

Gin Ala Ala Trp Ala Glu Ala Glu Ala Asp Ala Ala Arg Ser Gly Ala 
1425 1430 1435 1440 

Gly Gly Asp Ser Gin Leu Leu Arg Ser Leu Arg Gly Arg Pro Glu Glu 
1445 1450 1455 

Ala Gin Leu Ala Glu Leu Leu Arg Leu Val Arg Thr His Ala Ala Ala 
1460 1465 1470 

Val Leu Gly Leu Gly Ser Pro Gly Ala Val Glu Ala Arg Arg Ser Phe 
1475 1480 1485 

Lys Asp Leu Gly Phe Asn Ser Val Thr Ala Val Glu Leu Arg Asn Arg 
1490 1495 1500 

Leu Lys Glu Ala Thr Gly Leu Arg Leu Glu Val Ser Leu Val Phe Asp 
1505 1510 1515 1520 

His Pro Asp Pro Ala Ser Leu Ala Arg His Leu Leu Asp Leu Ala Leu 
1525 1530 1535 

Gly Gin Glu Pro Glu Glu Thr Pro Arg Ala Phe Ala Leu Glu Pro Ala 
1540 1545 1550 

Pro Asn Gly Glu Pro He Ala He Val Ser Met Ala Cys Arg Met Pro 
1555 1560 1565 

Gly Gly Val Ser Thr Pro Glu Glu Leu Trp Arg Leu Leu Arg Asp Gly 
1570 1575 1580 

Lys Asp Ala He Gly Pro Phe Pro Ala Asn Arg Gly Trp Asp Leu Glu 
1585 1590 1595 1600 

Asn Leu Tyr Asp Pro Asp Pro Asp Ala Asp Gly Arg Thr Tyr Val Arg 
1605 1610 " 1615 

Glu Gly Gly Phe Leu His Glu Ala Pro Asp Phe Asp Pro Ser Phe Phe 
1620 1625 1630 

Gly He Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg Leu 
1635 1640 1645 

Leu Leu Glu Thr Ser Trp Glu Ala Leu Glu Arg Ala Gly He Asp Pro 
1650 1655 1660 

Ala Arg Leu Arg Gly Ser Arg Thr Gly Val Phe Val Gly Thr Asn Gly 
1665 1670 1675 i6 80 

Gin His Tyr Met Pro Leu Leu Gin Asn Gly Gly Asp Ser Phe Asp Gly 
1685 1690 1695 
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Tyr Leu Gly Thr Gly Asn Ser Ala Ser Val Met Ser Gly Arg Leu Ser 
1700 1705 X710 

Tyr Val Phe Gly Leu Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys 
1715 1720 1725 

Ser Ala Ser Leu Val Ala Leu His Leu Ala Val Gin Ala Met Arg Arg 
1730 1735 1740 

Gly Glu Cys Asp Met Ala Leu Val Gly Gly Ala Thr Val Met Ser Thr 
1745 1750 1755 1760 

Pro Glu Met Leu Val Glu Phe Ser Arg Gin Arg Val He Ser Ala Asn 
1765 1770 1775 

Gly Arg Ser Arg Ala Phe Ala Ala Gly Ala Asp Gly Val Ala Leu Gly 
1780 1785 1790 

Glu Gly Val Gly Val Leu Leu Val Glu Arg Leu Ser Asp Ala Glu Arg 
1795 1800 1805 

Asn Gly His Pro Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gin 
1810 1815 1820 

Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gin Gin 
1825 1830 1835 1840 

Arg Val He Arg Gin Ala Leu Ala Asp Ala Gly Leu Arg Pro Glu Asp 
1845 1850 1855 

He Asp Ala Val Glu Ala His Gly Thr Gly Thr Glu Leu Gly Asp Pro 
I860 1865 1870 

He Glu Ala Glu Ala Leu Leu Ala Thr Tyr Gly Arg Thr Arg Thr Ala 
1875 1880 1885 

Asp Arg Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn He Gly His Thr 
1890 1895 1900 

Gin Ala Ala Ala Gly Val Ala Gly Val He Lys Met Val Leu Ala Leu 
1905 1910 1915 1920 

Gly Asn Glu Thr Leu Pro Arg Thr Leu His Val Asp Glu Pro Thr Pro 
1925 1930 1935 

Arg Val Asp Trp Ser Ser Gly Ala Val Ser Leu Leu Thr Glu Pro Val 
1940 1945 1950 

Asp Trp Pro Ala Gly Pro Ser Ala Pro Arg Arg Ala Ala Val Ser Ser 
1955 i960 1965 

Phe Gly He Ser Gly Thr Asn Ala His Thr He Leu Glu Gin Ala Pro 
1970 1975 1980 

Val Pro Ala Glu Ser Arg Pro Gly Thr Glu Pro Ala Asp Gly Thr Gly 
1985 1990 1995 * 2000 

Ala Trp Glu Asn Val Thr Val Pro Leu Leu Leu Ser Gly His Thr Glu 
2005 2010 2015 
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Ala Ala Leu Arg Glu Gin Ser Thr Arg Leu Leu Asn Asp Leu Leu Glu 
2020 2025 2030 

His Pro Asp Glu His Pro Ala Asp Val Gly Tyr Thr Leu He Thr Gly 
2035 2040 2045 

Arg Ala His Phe Gly His Arg Ala Ala Val He Gly Glu Ser Arg Glu 
2050 2055 2060 

Glu Leu Leu Asp Ala Leu Lys Ala Leu Ala Glu Gly Arg Glu His His 
2065 2070 2075 ~ 2080 

Thr Val Val Arg Gly Asp Gly Thr Ala His Pro Asp Arg Arg Val Val 
2085 2090 2095 

Phe Val Phe Pro Gly Gin Gly Ser Gin Trp Pro Ser Met Ala Arg Asp 
2100 2105 2110 

Leu Leu Asp Arg Ala Pro Ala Phe Arg Glu Thr Ala Lys Ala Cys Asp 
2H5 2120 2125 

Ala Ala Leu Ser Val His Leu Asp Trp Ser Val Leu Asp Val Leu Gin 
2130 2135 2140 

Glu Lys Pro Asp Ala Pro Pro Leu Ser Arg Val Asp Val Val Gin Pro 
2145 2150 2155 2160 

Val Leu Phe Thr Met Met Leu Ser Leu Ala Ala Cys Trp Arg Asp Leu 
2165 2170 2175 

Gly Val His Pro Ala Ala Val Val Gly His Ser Gin Gly Glu He Ala 
2180 2185 2190 

Ala Ala Cys Val Ala Gly Ala Leu Ser Leu Glu Asp Ala Ala Arg He 
2195 2200 2205 

Val Ala Leu Arg Ser Arg Ala Trp Leu Thr Leu Ala Gly Lys Gly Gly 
2210 2215 2220 

Met Ala Ala Val Ser Leu Pro Glu Ala Arg Leu Arg Glu Arg He Glu 
2225 2230 2235 2240 

Arg Phe Gly Gin Arg Leu Ser Val Ala Ala Val Asn Ser Pro Gly Thr 
2245 2250 2255 

Ala Ala Val Ala Gly Asp Val Asp Ala Leu Arg Glu Leu Leu Ala Glu 
2260 2265 2270 

Leu Thr Ala Glu Gly He Arg Ala Lys Pro He Pro Gly Val Asp Thr 
2275 2280 2285 

A1S ^L" iS Ser Ala Gln Val Asp Glv Leu Gl« His Leu Phe Glu 
2290 2295 2 300 

Val Leu Ala Pro Val Ser Pro Arg Ser Ser Asp He Pro Phe Tyr Ser 
2305 2310 2315 2320 

Thr Val Thr Gly Ala Pro Leu Asp Thr Glu Arg Leu Asp Ala Gly Tyr 
2325 2330 2335 
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10 



15 



20 



Trp Tyr Arg Asn Met Arg Glu Pro Val Glu Phe Glu Lys Ala Val Arg 
2340 2345 2350 

Ala Leu He Ala Asp Gly Tyr Asp Leu Phe Leu Glu Cys Asn Pro His 
2355 2360 2365 

Pro Met Leu Ala Met Ser Leu Asp Glu Thr Leu Thr Asp Ser Gly Gly 
2370 2375 2380 

His Gly Thr Val Met His Thr Leu Arg Arg Gin Lys Gly Ser Ala Lys 
2385 2390 2395 2400 

Asp Phe Gly Met Ala Leu Cys Leu Ala Tyr Val Asn Gly Leu Glu He 
2405 2410 2415 

Asp Gly Glu Ala Leu Phe Gly Pro Asp Ser Arg Arg Val Asn Pro Pro 
2420 2425 2430 

Thr Tyr Pro Phe Gin Arg Glu Arg Tyr Trp *iyr His Pro Thr Ser Gly 
2435 2440 2445 

Arg Arg Gly Asp He Thr Ala Ala Gly Val Ala Glu Ala Glu His Pro 
2450 2455 2460 

Leu Leu Gly Ala Gly Val Glu Leu Pro Glu Thr Gly Gly Thr Val Tyr 
25 2465 2470 2475 2480 

Thr Ala Arg Phe Gly Pro Asp Ser Arg Pro Trp Leu Ala Asp His Ala 
2485 2490 2495 

30 Leu Leu °ly Thr Val Leu Leu Pro Gly Thr Ala He Leu Asp Leu Val 

2500 2505 2510 

Leu Trp Ala Gly Glu Arg Ser Gly Cys Gly Arg Val Gly Glu Leu Ala 
2515 2520 2525 

35 Leu Gin Ala Pro Leu Val Leu Pro Asp Ser Gly Asp Val Glu Leu Arg 

2530 2535 2540 

Leu Leu Val Gly Gly Pro Asp Glu Glu Lys Arg Arg Thr Val Thr Val 
2545 2550 2555 2560 

His Ala Arg Pro Ala Ala Ala Gly Ala Glu Ala Pro Trp Thr Arg His 
2565 2570 2575 

Ala Glu Ala Val Val Leu Pro Ala Thr Gly Glu Glu Pro Thr Pro Ala 
2580 2585 2590 

Pro Arg Pro Val Pro Glu Pro Ala Gly Thr Thr Asp Pro Ala Ala Phe 
2595 2600 2605 

Tyr Ala Glu Phe Ala Glu Arg Gly Tyr Asp Tyr Gly Pro Ala Phe Gin 
50 2610 2615 2620 

Gly Phe Thr Ala Gly Ala Arg His Gly Glu Asp Val Val Ala Glu Val 
2625 2630 2635 2640 

55 Ala Leu Pro Ser Gly Leu Val Ala Asp Ala Arg His His Arg Leu His 

2645 2650 2655 



40 



45 
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Pro Ala Leu Leu Asp Ala Ala Leu Gin Ala Met He Leu Gly Thr Phe 
2660 2665 2670 

Phe Ala Asp Asp Gly Arg Ala Arg Met Pro Phe Ala Val Arg Gly Val 
2675 . 2680 2685 

Arg Leu His Thr Ala Gly Ala Asp Arg Leu Arg Val Leu He Ser Pro 
2690 2695 2700 

Ala Gly Asp Glu Thr Val Arg Leu Leu Cys Thr Asp Leu Ala Thr Gly 
2705 2710 2715 2720 

Ala Pro Val Leu Glu He Asp Glu Leu Val Val Arg Pro Val Ser Gly 
2725 2730 2735 

Glu Gin Leu Ala Ala Gly Ala Pro Gly Arg Asn Gly Gly Glu Leu Tyr 
2740 2745 2750 

Arg Val Asp Trp Thr Val Leu Pro Glu Pro Ala Glu Val Pro Ala Pro 
2755 2760 2765 

Arg Trp Ala Leu Leu Gly Glu Asp His Ala Gly Leu Ala Asp Val Leu 
2770 • 2775 2780 

Gly Gly Thr Gly Gly Gly Cys Glu Arg Tyr Asp Thr Leu Thr Gly Leu 
2785 2790 2795 2800 

Leu Glu Ala Thr Thr Arg Ser Ala Gly Gly He Leu Pro Asp He Val 
2805 2810 2815 

Ala Leu Ser Leu Pro Thr Ala Pro Glu Pro Gly Pro Gin Ala Val Arg 
2820 2825 2830 

Glu Val Leu Ser Gin Ala Leu Asp Ala Ala Gin Ala Trp Leu Ala Ala 
2835 2840 2845 

Gly Ala Glu Thr Ala Ser Ala Arg Leu Val Phe Val Thr Gly Gly Ala 
285 0 2855 2860 

Val Ala Thr Thr Ala Asp Glu Thr Val Arg Asp He Ala Ala Ala Ala 
2865 2870 2875 2880 

Val Trp Gly Leu Val Arg Ser Ala Gin Ser Glu Glu Pro Asp Arg Met 
2885 2890 2895 

Val Leu Leu Asp Leu Asp Gly Glu Arg Pro Thr Ala Arg Thr Leu Ala 
2900 2905 2910 

Ala Ala Leu Ala Ser Gly Glu Pro Gin Leu Ala Val Arg Gly Ser Thr 
2915 2920 2925 

Val ^/ la Pr ° Leu Ala Pro Ala G1 V Pro G ^y ^o Glu Asp Leu 

2930 2935 2940 

^L Pr ° Pr ° Ala Gly Thr ^ Ala Tr P Ar 9 Leu Tnr Pro Gly Gly Gly 
2945 2950 2955 * 2 960 

Thr Leu Glu Glu Leu Ser Leu Ala Pro Ala Pro Asp Ala Glu Glu Pro 
2965 2970 2975 
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Leu Ala Pro Gly Gin Val Arg He Ala Val Arg Ala Ala Gly Val Asn 
2980 2985 2990 

Phe Arg Asp Ala Leu He Ala Leu Gly Met Tyr Pro Gly Lys Gly Thr 
2995 3000 3005 

Met Gly Ala Glu Gly Ala Gly Val Val Val Glu Thr Ala Pro Asp Val 
3010 3015 3020 

Thr Gly Leu Ser Ala Gly Asp Arg Val Leu Gly Met Trp Asn Gly Gly 
3 025 3030 3035 *" 3040 

Phe Gly Pro Leu Val Val Ala Asp His Arg Met Val Ala Pro He Pro 
3045 3050 3055 

His Gly Trp Ser Tyr Ala Glu Ala Ala Ser Val Pro Ala Val Leu Leu 
3060 3065 3070 

Thr Ser Tyr Tyr Ala Leu Thr Arg Leu Ala Arg Ala Arg Thr Gly Gin 
20 3 0 7 5 3 0 8 0 3 0 85 

Thr Val Leu Val His Ala Ala Ala Gly Gly Val Gly Met Ala Thr Leu 
3090 3095 3100 



TO 



75 



Gin Leu Ala Arg His Leu Gly Leu Glu Val Tyr Ala Thr Ala Ser Thr 
3105 3110 3115 3120 

Gly Lys Trp Asp Ala Leu Gin Lys His Gly He Pro Asp Asp Arg He 
3125 3130 3135 

Ala Asp Ser Arg Thr Leu Asp Phe Ala Glu Arg Phe Leu Ser Arg Thr 
3140 3145 3150 

Gly Gly Arg Gly Val Asp He Val Leu Asn Ser Leu Ala Gly Glu Phe 
3155 3160 3165 

Val Asp Ala Ser Leu Arg Leu Leu Pro Arg Gly Gly His Phe Leu Glu 
3170 3175 3180 

Leu Gly Lys Ala Asp Val Arg Asp Pro Arg Arg He Ala Ala Ala His 
3185 3190 3195 3200 

Pro Gly Thr Asp Tyr Arg Ala Phe Asp Leu Val Gin Ala Gly Pro Asp 
3205 3210 3215 

Thr Val Gly Glu Met Leu Gly Glu Leu Leu Glu Leu Phe Ala Ala Gly 
3220 3225 3230 

Ala Leu Arg Pro Leu Pro Leu Thr Ala Tyr Gly lie Arg Asp Ala Arg 
3235 3240 3245 

Thr Ala Leu Thr Leu Ser Gin Ala Arg His Thr Gly Lys Leu Val 

50 3250 3255 3260 

Leu Thr Val Pro Ala Gly Phe Asp Thr His Arg Thr Val Leu Leu Thr 
3265 3270 3275 3280 

55 Gly Gly Thr Gly Thr Leu Gly Gin Thr Leu Ala Arg His Leu Val Asn 

3285 3290 3295 



25 



30 



35 



40 



45 
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w 



15 



20 



Arg His Gly Val Arg His Leu Leu Leu Ala Gly Arg Thr Gly Ala Ala 
3300 3305 3310 

Ala Glu Gly Val Ala Glu Leu He Gly Glu Leu Gly Glu Leu Gly Ala 
3315 3320 3325 

Glu Val Arg Val Ala Ala Cys Asp Ala Ala Asp Arg Gin Arg Leu Thr 
3330 3335 3340 

Glu Leu Leu Ala Gly He Pro Val Glu His Pro Leu Gly Ala Val Val 
3345 3350 3355 3360 

His Ala Ala Gly Thr Leu Asp Asp Gly Thr He Pro Ser Leu Thr Gly 
3365 3370 3375 

Glu Asn He Asp Asn Val Leu Arg Pro Lys Ala Asp Ala Val Leu Asn 
3380 3385 3390 

Leu His Glu Leu Thr Arg Asp Ala Asp Leu Ser Ala Phe Val Leu Tyr 
3395 3400 3405 

Ser Ser Ser Ser Ala Leu Leu Gly Ser Pro Gly Gin Gly Ala Tyr Ala 
3410 3415 3420 

Ala Ala Asn Ala Phe Leu Asp Gly Phe Ala Arg Tyr Arg Lys Gly Leu 
25 3425 3430 3435 " 3440 

Gly Leu Pro Ala Leu Ser Leu Ala Trp Gly Leu Trp Gly Ser Asn Ser 
3445 3450 * 3455 

30 Arg Met Ala Gly His Leu Asp Gin Ser Gly Met Gin Arg Arg Leu Asn 

3460 3465 3470 

Arg Ser Gly He Met Ala Leu Thr Asp Ala Glu Gly Leu Ala Leu Phe 
3475 3480 3485 

Asp Ala Ala Gin Asp Gly Gly Asp Ala Leu Leu Val Pro Met Arg Leu 
3490 3495 3500 

Asn Arg Thr Ala Leu Arg Ala Ser Gly Arg He Thr Pro Phe Leu Ser 
3505 3510 3515 3520 

Gly Leu Ala Gly Gly Gly Pro Ala Ala Gly Glu Arg Arg Pro Glu Val 
3525 3530 3535 

Ala Ala Val Ser Gly Thr Leu Ala Glu Arg Leu Thr Gly Leu Thr Ala 
3540 3545 3550 

Gin Glu Gly His Ala Leu Val Leu Ala Glu He Arg Ala His Ala Ala 
3555 3560 3565 

Ala Val Leu Gly His Gly Ser Asp Asp Ser He Pro Glu Asp Arg Ala 
50 3570 3575 3580 

Phe Lys Asp Leu Gly Phe Asp Ser Leu Thr Ala Val Glu Met Arg Asn 
3 *85 3590 3595 3600 

55 Arg Leu Ser Ala Ala Thr Gly Leu Arg Leu Pro Ala Thr Leu Val Phe 

3605 3610 3615 



40 
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Asp His Pro Thr Pro Gly Glu Leu Ala Gly His Leu Ser Ala Glu Leu 
3620 3625 3630 

Ser Ala Asp Asp Ala Pro Gly Ser Ala Ser Pro Leu Thr Glu Leu Asp 
3635 3640 3645 

Arg Phe Glu Ala Leu Phe Thr Ala Leu Ala Pro Gly Thr Thr Lys Asp 
3650 3655 3660 

Thr Pro Gly Gly Ala Gly Ala Leu Met lie Asp Glu Ala Glu Arg Gin 
3665 3670 3675 3680 

Glu He Ala Gly Arg Leu Ala Ala Leu Ala Gly Leu Trp Asn Arg Leu 
3685 3690 3695 

His Gly Thr Thr Thr Ala Pro Glu Asp Gly Asp Thr Val Ala Asp Ala 
3700 3705 3710 

Leu Glu Ala Ala Asp Asp His Glu He Phe Ala Phe Leu Asp Glu Arg 
3715 3720 3725 

Phe * 

3730 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1612 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 

Met Ala Asn Ala Asn Glu Gin Gin Leu Arg Ala Tyr Leu Lys Arg Ala 
1 5 10 15 

Thr Thr Glu Leu His Arg Thr Ser Glu Gin Leu Arg Glu Glu Arg Ala 
20 25 30 

Arg Ala His Glu Pro He Ala Val Val Gly Met Ala Cys Arg Tyr Pro 
35 40 45 

Gly Gly Ala Asn Thr Pro Glu Gin Phe Trp Glu Leu Leu Asp Thr Gly 
50 55 60 

Thr Asp Ala Ala Ala Pro Met Pro Ser Asp Arg Gly Trp Asp Thr His 
65 70 75 80 

Gly Leu Tyr Asp Pro Asp Pro Ala Ala Ala Gly Arg Thr Tyr Cys Arg 
85 90 95 

Glu Gly Gly Phe Leu His Asp Ala Gly Asp Phe Asp Ala Asp Phe Phe 
100 105 110 

Gly He Ser Pro Arg Glu Ala Val Ala Met Asp Pro Gin Gin Arg Leu 
115 120 125 
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Leu Leu Glu Thr Ser Trp Glu Ala lie Glu Ala Ala Gly lie Asp Pro 
130 135 140 

Arg Gly Leu Arg Gly Ser Arg Thr Gly Val Tyr Val Gly Ala Trp Asp 
145 150 155 160 

Ser Gly Tyr Thr Gly Gin Ala His Ala Pro Ser Ala Glu Leu Glu Ala 
165 170 175 

Asp Leu Leu Thr Gly Gly Val Val Ser Phe Thr Ser Gly Arg He Ala 
180 185 190 

Tyr Thr Leu Gly Leu Glu Gly Pro Ala Leu Thr Val Asp Thr Ala Cys 
195 200 205 

Ser Ser Ser Leu Val Ala Leu His Asn Ala Ala Gin Ala Leu Arg Arg 
210 215 220 

Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly Val Thr Val Met Ala Thr 
225 230 235 240 

Pro Ala Val Phe Val Gin Phe Ala Arg Gin Arg Gly Leu Ala Pro Asp 
245 250 255 

Gly Arg Cys Lys Ala Phe Ala Asp Ala Ala Asp Gly Phe Gly Pro Ala 
260 265 270 

Glu Gly Val Gly Met Val Leu Val Glu Arg Leu Ser Asp Ala Arg Arg 
275 280 285 

Leu Gly His Pro Val Leu Ala Val Val Cys Gly Ser Ala Val Asn Gin 
290 295 300 

Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ser Gin Glu 
305 310 315 * 320 

Arg Val He Arg Gin Ala Leu Gly Asn Ala Arg Leu Thr Val Ala Asp 
325 330 335 

Val Asp Val Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro 
340 345 350 

He Glu Ala Gin Ala Leu Leu Gly Thr Tyr Gly Arg Asp Arg Asp Gly 
355 3 60 " 365 

Gly Arg Pro Val Trp Leu Gly Ser Leu Lys Ser Asn He Gly His Ala 
370 375 380 

Gin Ala Ala Ala Gly Val Ala Gly Val lie Lys Met Val Leu Ala Met 
385 390 395 400 

Arg Tyr Gly Trp Leu Pro Arg Thr Leu His Val Asp Glu Pro Ser Arg 
405 410 415 

His Val Asp Trp Ser Ala Gly Gly Val Arg Leu Leu Thr Glu Ala Arg 
420 425 430 



Glu Trp Pro Gly Val Asp Arg Pro Arg Arg Ala Ala Val Ser Ala Phe 
435 440 445 
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Gly Val Ser Gly Thr Asn Ala His Leu lie Leu Glu Ala Pro Asp Thr 
450 455 460 

Ala Glu Ala Glu Ser Ala Thr Thr Pro Val Arg Ser Glu Val Ser Glu 

465 470 475 480 



70 



Ser Ala Ala Val Leu Asp Ala Arg Ser Gly Val Val Pro Val Val Val 
485 490 495 

Ser Gly Arg Ser Arg Val Val Val Arg Glu Ala Ala Gly Arg Leu Ala 
500 505 510 



75 



Glu Val Val Glu Ala Gly Gly Val Gly Leu Ala Asp Val Ala Val Thr 
515 520 525 

Met Ala Gly Arg Ser Arg Phe Gly Tyr Arg Ala Val Val Leu Ala Arg 
530 535 540 



20 



Gly Glu Ala Glu Leu Ala Gly Arg Leu Arg Ala Leu Ala Gly Gly Asp 
545 550 555 560 



25 



Pro Asp Ala Gly Val Val Thr Gly Ala Val Val Asp Pro Glu Thr Gly 
565 " 570 575 

Ser Gly Gly Gly Gly Val Val Leu Val Phe Pro Gly Gin Gly Thr Gin 
580 585 590 



30 



Trp Val Gly Met Gly Ala Gly Leu Leu Gly Ser Ser Glu Val Phe Ala 
595 600 605 

Ala Ser Met Arg Glu Cys Ala Arg Ala Leu Ser Val His Val Gly Tro 
610 615 620 



35 



40 



45 



50 



Asp Leu Leu Glu Val Val Ser Gly Gly Ala Gly Leu Glu Arg Val Asp 
625 «0 635 640 

Val Val Gin Pro Val Thr Trp Ala Val Met Val Ser Leu Ala Arg Tyr 
6 *5 650 655 

Trp Gin Ala Met Gly Val Asp Val Ala Ala Val Val Gly His Ser Gin 
660 665 670 

Gly Glu He Ala Ala Ala Thr Val Ala Gly Ala Leu Ser Leu Glu Asp 
67 5 680 685 

Ala Ala Ala Val Val Ala Leu Arg Ala Gly Leu He Gly Arg Tyr Leu 
fi 90 695 700 

Ala Gly Arg Gly Ala Met Ala Ala Val Pro Leu Pro Ala Gly Glu Val 
705 710 715 7 20 

Glu Ala Gly Leu Ala Lys Trp Pro Gly Val Glu Val Ala Ala Val Asn 
725 730 735 

Gly Pro Ala Ser Thr Val Val Ser Gly Asp Arg Arg Ala Val Ala Gly 
740 745 750 



55 



lVr Val Ala Val Cys Gin Ala Glu Gly Val Gin Ala Arg Leu He Pro 
755 760 765 



100 



EP0 791 655 A2 



Val Asp Tyr Ala Ser His Ser Arg His Val Glu Asp Leu Lys Gly Glu 
770 775 780 

Leu Glu Arg Val Leu Ser Gly He Arg Pro Arg Ser Pro Arg Val Pro 
785 790 795 800 

Val Cys Ser Thr Val Ala Gly Glu Gin Pro Gly Glu Pro Val Phe Asp 
805 810 815 

Ala Gly Tyr Trp Phe Arg Asn Leu Arg Asn Arg Val Glu Phe Ser Ala 
820 825 830 

Val Val Gly Gly Leu Leu Glu Glu Gly His Arg Arg Phe He Glu Val 
835 840 845 

Ser Ala His Pro Val Leu Val His Ala He Glu Gin Thr Ala Glu Ala 
850 855 860 

Ala Asp Arg Ser Val His Ala Thr Gly Thr Leu Arg Arg Gin Asp Asp 
20 865 870 875 880 

Ser Pro His Arg Leu Leu Thr Ser Thr Ala Glu Ala Trp Ala His Gly 
885 890 895 



w 



15 



25 



Ala Thr Leu Thr Trp Asp Pro Ala Leu Pro Pro Gly His Leu Thr Thr 
900 905 910 

Leu Pro Thr Tyr Pro Phe Asn His His His Tyr Trp Leu Asp Thr Thr 
915 920 925 

30 Pro Thr Thr Pro Ala Thr Thr Thr Gin Ser Pro Thr Asp Ala Trp Arg 

930 935 940 

Tyr Arg Val Thr Trp Lys Ala Leu Thr Glu Glu Ser Thr Pro Ala Ser 
945 950 955 960 

35 

Ser Pro Ser Gly His Trp Leu Leu Val Thr Pro Pro Thr Pro Glu Gly 
965 970 975 

Arg Thr Leu Gly Asp Arg Ala Ala Gly Ala Leu Ala Arg Gin Gly Ala 
980 985 990 

Thr Val Glu Arg Leu Val Val Asp Pro Val Ala Val Gly Arg Asp Gly 
995 1000 1005 

Leu Ala Ala Arg Leu Gly Glu Arg Trp Asp Gly Val Leu Ser Leu Leu 
1010 1015 1020 

Gly Ala Asp Glu Arg Pro Leu Pro Arg His Pro Ala Leu Asn Arg Ala 
1025 1030 1035 1040 

Val Met Gly Thr Thr Leu Leu Ala Gin Ala Ala Leu Asp Ala Gly Cys 
50 1045 1050 1055 

Glu Ala Arg He Trp Ala Val Thr Arg Glu Ala Val Ala Val Ser Pro 
1060 1065 1070 

55 Ser Glu Val Pro Arg Asp Ala Gly Ala Gin Leu Trp Gly Leu Gly Arg 

1075 1080 1085 



40 



45 
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Gly lie Ala Leu Glu His Pro Ser Leu Trp Gly Cly Leu He Asp Leu 
1090 1095 1100 

Pro Ala Val Pro Asp Glu Arg Ala Trp Ala Arg Ala Val Arg Arg Leu 
1105 1110 1115 1120 

Val Pro His Gly Glu Asp Gin He Ala Ala Arg Ala Ser Gly Ala Tyr 
1125 1130 1135 

Gly Arg Arg Leu Leu Pro Ala Pro Pro Ala Ala Ser Arg Arg Thr Cys 
1140 1145 1150 

Thr Pro Ser Gly Thr Val Leu Val Thr Gly Gly Thr Gly Ala Leu Gly 
1155 1160 1165 

Gly His Leu Ala Arg Arg Leu Ala Arg Gly Gly Thr Gly His Leu Val 
1170 1175 1180 

Leu Thr Ser Arg Arg Gly Pro Asp Ala Pro Gly Ala Gly Glu Leu Ala 
1185 1190 1195 1200 

Gly Glu Leu Ala Ser Leu Gly Ala Lys Val Thr Val Ala Ala Cys Asp 
1205 1210 1215 

Met Ala Asp Arg Glu Ala Val Arg Ala Leu Leu Asp Glu His Arg Pro 
1220 1225 1230 

Thr Ala Val Phe His Thr Ala Gly Thr Pro His Ser Ala Glu Phe Thr 
1235 1240 1245 

Ala Leu Asp Glu Thr Thr Thr Ala Gly Val Tyr Gly Gly Lys Val Leu 
1250 1255 1260 

Gly Ala Arg His Leu Asp Glu Leu Thr Arg Glu Leu Gly He Gly Leu 
1265 1270 1275 1280 

Asp Ala Phe Val Leu Phe Ser Ser Gly Ala Ala Val Trp Gly Ser Gly 
1285 1290 1295 

Gly Gin Thr Ala Tyr Gly Ala Ala Asn Ala Ala Leu Asp Ala Leu Ala 
1300 1305 1310 

Glu Arg Arg Arg Ala Ala Gly Leu Pro Ala Thr Ser Val Ala Trp Gly 
1315 1320 1325 

Leu Trp Gly Gly Gly Gly Met Gly Glu Gly Asp Gly Glu Glu Phe Leu 
1330 1335 1340 

Ser Arg Arg Gly Leu Gly Val Met Pro Pro Glu Asp Ala Leu Glu Ala 
1345 1350 1355 1360 

Leu Asp Arg Ala Leu Asp Arg Glu Asp Thr Thr Val Val Val Ala Asp 
1365 1370 1375 

Val Asp Trp Glu Arg Phe Ala Pro Ala Phe Thr Ala Phe Arg Pro Ser 
1380 1385 1390 

Ala Leu He Ser Arg Leu Val Ser Asp Gly Gly Glu Ala Gly Gly Gin 
1395 1400 1405 
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Asp Ala Pro Asp Gly Thr Leu Phe Ala Ala Gly Phe Ala Ala Ala Gly 
1410 1415 1420 

Pro Leu Glu Arg Gin Glu Met Leu Leu Gly Leu Val Arg Arg His Val 
1425 1430 1435 1440 

Ala Ala Val Leu Gly His Pro Gly Thr Ala Asp lie Gly Pro Asp Arg 
1445 1450 1455 

Ala Phe Lys Glu Leu Gly Phe Ser Ser Val Thr Ala Val Glu Leu Ala 
1460 1465 1470 

Gly Arg Leu Gly Arg Glu Cys Gly Arg Lys Leu Pro Pro Thr Leu Val 
1475 1480 1485 

Phe Asp His Pro Thr Ala Ala Ala Ala Val Glu His Leu Ala Glu Leu 
1490 1495 1500 

Leu Thr Pro Pro Ala Gly Pro Ala Ala Gly Pro Arg Glu Glu Glu Ala 
1505 1510 1515 1520 

Arg Ala Ala Leu Ala Arg Val Pro Leu Glu Arg Leu Arg Glu Ala Gly 
1525 1530 " 1535 

Leu Leu Asp Ala Leu Leu Arg Leu Ala Ala Asp Glu Ser Gly Ala Thr 
1540 1545 1550 

Thr Pro Arg Thr Ser Ala Ala Ser Gly Ala Pro Arg Gly Arg Glu Glu 
1555 1560 1565 

Pro Asp Gly Arg Gly Glu Pro Asp Gly Ser Gly His Arg Glu Ser Pro 
1570 1575 1580 

Asp Ala Ala Gly Gly Ser Asp Ala Leu Asp Asp Leu Asp Gly Asp Ala 
1585 1590 1595 1600 

Leu Val Arg Leu Ala Leu Gly Glu Pro Gly Glu * 
1605 1610 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1842 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Ala Met Ser Ala Glu Arg Leu Thr Glu Ala Leu Arg Thr Ser Leu 
15 10 15 

Lys Glu Ala Glu Arg Leu Arg Arg Gin Asn Arg Glu Leu Arg Ala Ala 
20 25 30 

Arg Asp Ala Ala Arg Clu Pro lie Ala Val Val Gly Met Ala Cys Arg 
35 40 45 
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Tyr Pro Gly Gly Val Thr Gly Pro Glu Glu Leu Trp Glu Leu Val Ala 
50 . 55 60 



Gly Gly Arg Asp Ala He Gly Pro Phe Pro Val Asp Arg Gly Trp Asp 
65 70 75 80 

Val Ala Ser Val Tyr Asp Pro Asp Pro Glu Ser Lys Gly Thr Thr Tyr 
85 90 95 

Cys Arg Glu Gly Gly Phe Leu Glu Gly Ala Gly Asp Phe Asp Ala Ala 
100 105 no 

Phe Phe Gly He Ser Pro Arg Glu Ala Leu Val Met Asp Pro Gin Gin 
115 120 125 

Arg Leu Leu Leu Glu Val Ser Trp Glu Ala Leu Glu Arg Ala Gly He 
130 135 i 4 o 

Asp Pro Ser Ser Leu Arg Gly Ser Arg Gly Gly Val Tyr Val Gly Ala 
145 150 155 160 

Ala His Gly Ser Tyr Ala Ser Asp Pro Arg Leu Val Pro Glu Gly Ser 
165 170 175 

Glu Gly Tyr Leu Leu Thr Gly Ser Ala Asp Ala Val Met Ser Gly Arg 
ISO 185 190 

He Ser Tyr Ala Leu Gly Leu Glu Gly Pro Ser Met Thr Val Glu Thr 
1^5 200 205 

Ala Cys Ser Ser Ser Leu Val Ala Leu His Leu Ala Val Arg Ala Leu 
210 215 220 

Arg His Gly Glu Cys Gly Leu Ala Leu Ala Gly Gly Val Ala Val Met 
225 230 235 240 

Ala Asp Pro Ala Ala Phe Val Glu Phe Ser Arg Gin Lys Gly Leu Ala 
24 5 250 * 255 

Ala Asp Gly Arg Cys Lys Ala Phe Ser Ala Ala Ala Asp Gly Thr Gly 
260 265 270 

Trp Ala Glu Gly Val Gly Val Leu Val Leu Glu Arg Leu Ser Asp Ala 
275 280 285 

Arg Arg Ala Gly His Thr Val Leu Gly Leu Val Thr Gly Thr Ala Val 
290 295 300 

Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ala 
305 310 315 320 

Gin Gin Arg Val He Ala Glu Ala Leu Ala Asp Ala Gly Leu Ser Pro 
325 330 335 

Glu Asp Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly 
340 345 350 

Asp Pro He Glu Ala Gly Ala Leu Leu Ala Ala Ser Gly Arg Asn Arg 
355 350 365 
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Ser Gly Asp His Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn lie Gly 
370 375 380 

His Ala Gin Ala Ala Ala Gly Val Gly Gly Val lie Lys Met Leu Gin 
385 390 395 400 

Ala Leu Arg His Gly Leu Leu Pro Arg Thr Leu His Ala Asp Glu Pro 
405 410 415 

Thr Pro His Ala Asp Trp Ser Ser Gly Arg Val Arg Leu Leu Thr Ser 
420 425 430 



75 



Glu Val Pro Trp Gin Arg Thr Gly Arg Pro Arg Arg Thr Gly Val Ser 
435 440 445 



20 



Ala Phe Gly Val Gly Gly Thr Asn Ala His Val Val Leu Glu Glu Ala 
450 455 460 

Pro Ala Pro Pro Ala Pro Glu Pro Ala Gly Glu Ala Pro Gly Gly Ser 
465 470 475 480 

Arg Ala Ala Glu Gly Ala Glu Gly Pro Leu Ala Trp Val Val Ser Gly 
485 490 495 



25 



Arg Asp Glu Pro Ala Leu Arg Ser Gin Ala Arg Arg Leu Arg Asp His 
500 505 510 



30 



Leu Ser Arg Thr Pro Gly Ala Arg Pro Arg Asp lie Ala Phe Ser Leu 
515 520 525 

Ala Ala Thr Arg Ala Ala Phe Asp His Arg Ala Val Leu He Gly Ser 
530 535 540 



35 



40 



Asp Gly Ala Glu Leu Ala Ala Ala Leu Asp Ala Leu Ala Glu Gly Arg 
545 550 555 560 

Asp Gly Pro Ala Val Val Arg Gly Val Arg Asp Arg Asp Gly Arg Met 
565 570 575 

Ala Phe Leu Phe Thr Gly Gin Gly Ser Gin Arg Ala Gly Met Ala His 
580 585 590 

Asp Leu His Ala Ala His Thr Phe Phe Ala Ser Ala Leu Asp Glu Val 
595 600 605 



45 



Thr Asp Arg Leu Asp Pro Leu Leu Gly Arg Pro Leu Gly Ala Leu Leu 
610 615 620 



50 



Asp Ala Arg Pro Gly Ser Pro Glu Ala Ala Leu Leu Asp Arg Thr Glu 
«5 630 635 640 

Tyr Thr Gin Pro Ala Leu Phe Ala Val Glu Val Ala Leu His Arg Leu 
645 650 655 

Leu Glu His Trp Gly Met Arg Pro Asp Leu Leu Leu Gly His Ser Val 
660 665 670 



55 



Gly Glu Leu Ala Ala Ala His Val Ala Gly Val Leu Asp Leu Asp Asp 
675 680 685 
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Ala Cys Ala Leu Val Ala Ala Arg Gly Arg Leu Met Gin Arg Leu Pro 
690 695 700 

Pro Gly Gly Ala Met Val Ser Val Arg Ala Gly Glu Asp Glu Val Arg 
705 710 715 720 

Ala Leu Leu Ala Gly Arg Glu Asp Ala Val Cys Val Ala Ala Val Asn 
725 730 735 

Gly Pro Arg Ser Val Val He Ser Gly Ala Glu Glu Ala Val Ala Glu 
740 745 750 

Ala Ala Ala Gin Leu Ala Gly Arg Gly Arg Arg Thr Arg Arg Leu Arg 
755 760 765 

Val Ala His Ala Phe His Ser Pro Leu Met Asp Gly Met Leu Ala Gly 
770 775 780 

Phe Arg Glu Val Ala Ala Gly Leu Arg Tyr Arg Glu Pro Glu Leu Thr 
785 790 795 800 

Val Val Ser Thr Val Thr Gly Arg Pro Ala Arg Pro Gly Glu Leu Thr 
805 810 815 

Gly Pro Asp Tyr Trp Val Ala Gin Val Arg Glu Pro Val Arg Phe Ala 
820 825 830 

Asp Ala Val Arg Thr Ala His Arg Leu Gly Ala Arg Thr Phe Leu Glu 
835 840 845 

Thr Gly Pro Asp Gly Val Leu Cys Gly Met Ala Glu Glu Cys Leu Glu 
850 855 860 

Asp Asp Thr Val Ala Leu Leu Pro Ala He His Lys Pro Gly Thr Ala 
865 870 875 ~ 880 

Pro His Gly Pro Ala Ala Pro Gly Ala Leu Arg Ala Ala Ala Ala Ala 
885 890 895 

Tyr Gly Arg Gly Ala Arg Val Asp Trp Ala Gly Met His Ala Asp Gly 
900 90S 910 

Pro Glu Gly Pro Ala Arg Arg Val Glu Leu Pro Val His Ala Phe Arg 
915 920 925 

His Arg Arg Tyr Trp Leu Ala Pro Gly Arg Ala Ala Asp Thr Asp Asp 
930 935 940 

Trp Met Tyr Arg He Gly Trp Asp Arg Leu Pro Ala Val Thr Gly Gly 
945 950 955 960 

Ala Arg Thr Ala Gly Arg Trp Leu Val He His Pro Asp Ser Pro Arg 
965 970 975 

Cys Arg Glu Leu Ser Gly His Ala Glu Arg Ala Leu Arg Ala Ala Gly 
980 985 990 

Ala Ser Pro Val Pro Leu Pro Val Asp Ala Pro Ala Ala Asp Arg Ala 
995 1000 1005 



106 



EP0 791 655 A2 



Ser Phe Ala Ala Leu Leu Arg Ser Ala Thr Gly Pro Asp Thr Arg Gly 
1010 1015 1020 

Asp Thr Ala Ala Pro Val Ala Gly Val Leu Ser Leu Leu Ser Glu Glu 
1025 1030 1035 1040 

Asp Arg Pro His Arg Gin His Ala Pro Val Pro Ala Gly Val Leu Ala 
1045 1050 1055 

Thr Leu Ser Leu Met Gin Ala Met Glu Glu Glu Ala Val Glu Ala Arg 

1060 1065 1070 

Val Trp Cys Val Ser Arg Ala Ala Val Ala Ala Ala Asp Arg Glu Arg 
1075 1080 1085 

Pro Val Gly Ala Gly Ala Ala Leu Trp Gly Leu Gly Arg Val Ala Ala 
1090 1095 1100 

Leu Glu Arg Pro Thr Arg Trp Gly Gly Leu Val Asp Leu Pro Ala Ser 
20 1105 1110 1115 1120 

Pro Gly Ala Ala His Trp Ala Ala Ala Val Glu Arg Leu Ala Gly Pro 
1125 1130 1135 



10 



15 



25 



Glu Asp Gin lie Ala Val Arg Ala Ser Gly Ser Trp Gly Arg Arg Leu 
1140 1145 1150 

Thr Arg Leu Pro Arg Asp Gly Gly Gly Arg Thr Ala Ala Pro Ala Tyr 
1155 1160 1165 

30 Arg Pro Arg Gly Thr Val Leu Val Thr Gly Gly Thr Gly Ala Leu Gly 

1170 1175 1180 

Gly His Leu Ala Arg Trp Leu Ala Ala Ala Gly. Ala Glu His Leu Ala 
1185 1190 1195 1200 

Leu Thr Ser Arg Arg Gly Pro Asp Ala Pro Gly Ala Ala Gly Leu Glu 
1205 1210 1215 

Ala Glu Leu Leu Leu Leu Gly Ala Lys Val Thr Phe Ala Ala Cys Asp 
1220 1225 1230 

Thr Ala Asp Arg Asp Gly Leu Ala Arg Val Leu Arg Ala He Pro Glu 
1235 1240 1245 

Asp Thr Pro Leu Thr Ala Val Phe His Ala Ala Gly Val Pro Gin Val 
1250 1255 1260 

Thr Pro Leu Ser Arg Thr Ser Pro Glu His Phe Ala Asp Val Tyr Ala 
1265 1270 1275 1280 

Gly Lys Ala Ala Gly Ala Ala His Leu Asp Glu Leu Thr Arg Glu Leu 
1285 1290 1295 

Gly Ala Gly Leu Asp Ala Phe Val Leu Tyr Ser Ser Gly Ala Gly Val 
1300 1305 1310 

55 Trp Gly Ser Ala Gly Gin Gly Ala Tyr Ala Ala Ala Asn Ala Ala Leu 

1315 1320 1325 



40 



45 



50 
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Asp Ala Leu Ala Arg Arg Arg Ala Ala Asp Gly Leu Pro Ala Thr Ser 
1330 1335 1340 

He Ala Trp Gly Val Trp Gly Gly Gly Gly Met Gly Ala Asp Glu Ala 
1345 1350 1355 1360 

Gly Ala Glu Tyr Leu Gly Arg Arg Gly Met Arg Pro Met Ala Pro Val 
1365 1370 1375 

Ser Ala Leu Arg Ala Met Ala Thr Ala He Ala Ser Gly Glu Pro Cys 
1380 1385 1390 

Pro Thr Val Thr His Thr Asp Trp Glu Arg Phe Gly Glu Gly Phe Thr 
1395 1400 1405 

Ala Phe Arg Pro Ser Pro Leu He Ala Gly Leu Gly Thr Pro Gly Gly 
1410 1415 1420 

Gly Arg Ala Ala Glu Thr Pro Glu Glu Gly Asn Ala Thr Ala Ala Ala 
20 1^25 1430 1435 1440 

Asp Leu Thr Ala Leu Pro Pro Ala Glu Leu Arg Thr Ala Leu Arg Glu 
1445 1450 1455 



10 



15 



Leu Val Arg Ala Arg Thr Ala Ala Ala Leu Gly Leu Asp Asp Pro Ala 
1460 1465 1470 

Glu Val Ala Glu Gly Glu Arg Phe Pro Ala Met Gly Phe Asp Ser Leu 
1475 1480 1485 

Ala Thr Val Arg Leu Arg Arg Gly Leu Ala Ser Ala Thr Gly Leu Asp 
1490 1495 1500 

Leu Pro Pro Asp Leu Leu Phe Asp Arg Asp Thr Pro Ala Ala Leu Ala 
1505 1510 1515 1520 

Ala His Leu Ala Glu Leu Leu Ala Thr Ala Arg Asp His Gly Pro Gly 
1525 1530 * 1535 

Gly Pro Gly Thr Gly Ala Ala Pro Ala Asp Ala Gly Ser Gly Leu Pro 
1540 1545 1550 

Ala Leu Tyr Arg Glu Ala Val Arg Thr Gly Arg Ala Ala Glu Met Ala 
1555 1560 1565 

Glu Leu Leu Ala Ala Ala Ser Arg Phe Arg Pro Ala Phe Gly Thr Ala 
45 1570 1575 1580 

Asp Arg Gin Pro Val Ala Leu Val Pro Leu Ala Asp Gly Ala Glu Asd 
1585 1590 1595 1600 



25 



30 



35 



40 



50 



55 



Thr Gly Leu Pro Leu Leu Val Gly Cys Ala Gly Thr Ala Val Ala Ser 
1605 1610 1615 

Gly Pro Val Glu Phe Thr Ala Phe Ala Gly Ala Leu Ala Asp Leu Pro 
1620 1625 1630 

Ala Ala Ala Pro Met Ala Ala Leu Pro Gin Pro Gly Phe Leu Pro Gly 
1635 1640 1645 
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Glu Arg Val Pro Ala Thr Pro Glu Ala Leu Phe Glu Ala Gin Ala Glu 
1650 1655 1660 

Ala Leu Leu Arg Tyr Ala Ala Gly Arg Pro Phe Val Leu Leu Gly His 
1665 1670 1675 1680 

Ser Ala Gly Ala Asn Met Ala His Ala Leu Thr Arg His Leu Glu Ala 
1685 1690 1695 

Asn Gly Gly Gly Pro Ala Gly Leu Val Leu Met Asp lie Tyr Thr Pro 
1700 1705 1710 

Ala Asp Pro Gly Ala Met Gly Val Trp Arg Asn Asp Met Phe Gin Trp 
1715 1720 1725 

Val Trp Arg Arg Ser Asp He Pro Pro Asp Asp His Arg Leu Thr Ala 
1730 1735 1740 

Met Gly Ala Tyr His Arg Leu Leu Leu Asp Trp Ser Pro Thr Pro Val 
1745 1750 1755 1760 

Arg Ala Pro Val Leu His Leu Arg Ala Ala Glu Pro Met Gly Asp Trp 
1765 1770 1775 

Pro Pro Gly Asp Thr Gly Trp Gin Ser His Trp Asp Gly Ala His Thr 
1780 1785 1790 

Thr Ala Gly lie Pro Gly Asn His Phe Thr Met Met Thr Glu His Ala 
1795 1800 1805 

Ser Ala Ala Ala Arg Leu Val His Gly Trp Leu Ala Glu Arg Thr Pro 
1810 1815 1820 

Ser Gly Gin Gly Gly Ser Pro Ser Arg Ala Ala Gly Arg Glu Glu Arg 
1825 1830 1835 1840 

Pro * 



(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44377 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 350.. 14002 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 14046.. 20036 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
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10 



is 



20 



25 



35 



40 



SO 



55 



(B) LOCATION: 20110.. 31284 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 31329.. 36071 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 36155,. 41830 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GACCGCTCGG GGAGACCTGA CATATTCGTC GCGAAGTCGT TCTCCGCGCC GCGAGGTACT 60 

GAAATCTTCT CCGCTCGCCC AGGACTCCGC GTGCAGGTCA CCGGAGTCCG CGACCGGCCG 120 

GGACGTCGGA GCGCCGACCC TGCGGACCTG GTCCGATCCC GTGTGGTCCC GCATGATCCC 180 

GCGCCGTCTC CGGTGACGAG AATCGGTGGA CAATCTCCGA ACTTGACACA ATTGATTGTC 240 

GTTCACCGGC CGTTCCTCTC GCCCGCCAGT TCGCCCGCTG TACGCTCGGG AAGATCAAGA 300 

AAAGGCAGAA AAGCCACGGC GTGGTACGGC GAACATATGA GGGATGCAGG TGTCTGGAGA 360 

ACTCGCGATT TCCCGCAGTG ACGACCGGTC CGACGCCGTT GCCGTGGTCG GAATGGCGTG 420 

CCGGTTTCCC GGCGCCCCGG GAATTGCCGA ATTCTCGAAA CTGCTGACCG ACGGAAGGGA 480 

CGCGATCGGC CGGGACGCCG ACGGCCGCCG GCGCGGCATG ATCGAGGCGC CCGGCGACTT 540 

30 CGACGCCGCC TTCTTCGGCA TGTCACCCCG CGAGGCCGCC GAGACCGACC CCCAGCAGCG 600 

CCTGATGCTC GAACTCGGCT GGGAGGCTCT GGAGGACGCC GGCATCGTCC CCGGCTCCCT 660 

GCGCGGCGAG GCGGTCGGCG TCTTCGTCGG GGCCATGCAC GACGACTACG CCACCCTCCT 720 

CCACCGCGCC GGCGCGCCGG TCGGCCCCCA CACCGCCACC GGCCTCCAGC GCGCCATGCT 780 

CGCCAACCGG CTCTCCTACG TCCTGGGGAC GCGCGGCCCC AGCCTCGCGG TCGACACCGC 840 

CCAGTCGTCC TCCCTGGTCG CCGTGGCCCT CGCCGTCGAG AGCCTGCGGG CCGGCACCTC 900 

CCGCGTCGCC GTCGCCGGGG GCGTCAACCT GGTCCTCGCC GACGAGGGAA CGGCCGCCAT 960 

GGAACGCCTC GGCGCGCTGT CACCCGACGG CCGCTCCCAC ACCTTCGACG CCCGTGCCAA 1020 

45 CGGCTATCTC CGCGGTGAGG GCGGCGCCGC OGTCGTCCTG AAGCCCCTCG CCGACGCCCT 1080 

GGCCGACGGG GACCCCGTGT ACTGCGTGGT GCGTOGCGTC GCCGTCGGCA ACGACGGCGG 1140 

CGGCCCCGGG CTGACCGCTC CCGACCGCGA GGGACAGGAG GCGGTGCTCC GGGCCGCCTC 1200 

CGCCCAGGCC CGGGTCGACC CCGCCGAGGT GCGTTTCGTC GAACTGCACG GCACGGGAAC 1260 

CCCGGTGGGC GACCCGGTCG AGGCACACGC CCTCGGCGCG GTGCACGGCT CCGGTCGCCC 1320 

GGCCGACGAC CCCCTGCTGG TGGGGTCGGT GAAGACCAAC ATCGGCCACC TGGAGGGCGC 1380 

CGCCGGCATC GCGGGCCTGG TCAAGGCCGC ACTCTGCCTG CGGGAACGCA CCCTTCCCGG 1440 
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CTCGCTGAAC TTCGCCACCC CCTCTCCGGC CATCCCGCTG GACCAGCTCC GGCTGAAGGT 1500 

GCAGACCGCT GCCGCCGAGC TGCCGCTCGC CCCGGGCGGC GCACCCCTGC TCGCGGGTGT 1560 

5 

CAGTTCGTTC GGCATCGGTC GCACCAACTG CCATGTQGTC CTGGAACACC TGCCCTCCCG 1620 

GCCCACCCCG GCCGTCTCCG TCGCCGCCTC GCTTCCGGAC GTCCCGCCGC TGTTGTTCTC 1680 

to CGCGCGGTCG GAGGGGGCGT TGCGGGCGCA GGCGGTGCGG TTGGGTGAGT ACGTGGAGCG 1740 

GGTGGGCGCG GATCCGCGGG ATGTGGCTTA TTCGCTGGCT TCGACGCGGA CTCTTTTCGA 1800 

GCACCGTGCG GTGGTGCCGT GTGGTCGGCG TGGGGAGCTC GTCGCTGCTC TTGGTGGGTT 1860 

15 TGCTGCCGGG AGGGTGTCTG GGGGTGTGCG GTCCGGGCGG GCTGTGCCGG GTGGGGTGGG 1920 

GGTGTTGTTC ACGGGTCAGG GTGCGCAGTG GGTTCGTATG GGGCGTGGGT TGTATGCGGG 1980 

GGGTGGGGTG TTTGCGGAGG TGCTGGATGA GGTGTTGTCG ATGGTGGGGG AGGTGGATGG 2040 

20 

TCGGTCGTTG CGGGATGTGA TGTTCGGCGA CGTCGACGTC GACGCGGGTG CCGGGGCTCA 2100 

TGCGGGTGCC GGTGCGGGTG CTGGGGTCGG TTCTGGTTCC GGTTCTGTGG GTGGGTTGTT 2160 

GGGTCGGACG GAGTCTCCTC AGCCTGCGTT GTTTGCGTTG GAGGTGGCGT TGTTCCGGGC 2220 

25 

GTTGGAGGCT CGGGGTGTGG AGGTGTCGGT GGTGTTGGGT CATTCGGTGG GGGAGGTGGC 2280 

TGCTGCGTAT GTGGCGGGGG TGTTGTCGTT GGGTCATGCG GTGCGGTTGG TGGTGGCGCG 2340 

30 GGGTGGGTTG ATGGGTCGGT TGCCGGTGGG TGGGGGGATG TGGTCGGTGG GGGCGTCGGA 2400 

GTCGGTGGTG CGGGGGGTTG TTGAGGGGTT GGGGGAGTGG GTGTCGGTTG CGGCGGTCAA 2460 

TGGGCCGCGG TCGGTGGTGT TCTCGGGTCA TGTGGGTGTG CTGGAGTCGG TGGTTGCCTC 2520 

35 GCTGATGGGG GATGGGGTGG AGTGCCGGCG GTTGGATGTG TCGCATGGGT TTCATTCGGT 2580 

GTTGATGGAG CCGGTGTTGG GGGAGTTCCG GGGGGTTGTG GAGTCGTTGG AGTTCGGTCG 2640 

GGTGCGGCCG GGTGTGGTGG TCGTGTCGGG TGTGTCGGGT GGGGTGGTGG GTTCGGGGGA 2700 

40 

GTTGGGGGAT CCGGGGTATT GGGTGCGTCA TGCGCGGGAG GCGGTGCGTT TCGCGGATGG 2760 

GGTGGGGGTC GTGCGTGGTC TGGGTGTGGG GACGTTGGTG GAGGTGGGTC CGCATGGGGT 2820 

45 GCTGACGGGG ATGGCGGGTG AGTGCCTGGG GGCCGGTGAT GATCTGGTGG TGGTCCCGGC 2880 

GATGCGGCGG GGCCGTGCGG AGCGGGAGGT GTTCGAGGCG GCGCTGGCGA CGGTGTTCAC 2940 

CCGGGACGCC GGCCTGGACG CCACGGCACT CCACACCGGG AGCACCGGCC GGCGCATCGA 3000 

SO CCTCCCCACC TACCCCTTCC AACGCCGTAC CCACTCGTCG CCCGCGCTGA GCCGGCCGGT 3060 

CACGGCCGAC CCCGGGGCCG GTGTGACCGC CACCGATGCC GTGGGGCACA GCGTCTCCCC 3120 

GGACCCGGAG AGCACCGAGG GGACGTCCCA CAGGGACACG GACGACGAGG CGGACTCGGC 3180 

ss 

GTCACCGGAG CCGATGTCCC CCGAGGATCC CGTCCGCCTG GTCCGCGAGA GCACCGCGGC 3240 
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15 



20 



25 



CGTCCTGGGC CACGACGATC CCGGCGAGGT CGCGCTCGAC CGCACCTTCA CCTCCCAGGG 3300 

^ CATGGACTCG GTGACCGCGG TCCAGCTCTG CGACCTGCTG AAGGGCGCCT CGGGCCTCCC 3360 

CCTCGCCGCC ACGCTSGTCT ACGACCTGCC CACCCCGCGT GCCGTCGCCG AGCACATCGT 3420 

GG AAGCCGCG GGCGGGCCGA AGGACTCGGT TGCCGGTGGG CCCGGAGTGC TCTCGTCGGC 3480 

70 CGCGGTAGGG GTGTCGGACG CCCGGGGCGG CAGCCGGGAC GACGACGACC CGATCGCCAT 3540 

CGTOGGTCTC GGCTGCCGGC TCCCCGGCGG CGTCGACTCG CGCGCCGCTC TCTGGGAGCT 3600 

GCTGGAGTCC GGCGCCGACG CCATCTCGTC CTTCCCCACC GACCGCGGCT GGGACCTCGA 3660 

CGGGCTGTAC GACCCCGAGC CCGGGACGCC CGGCAAGACC TATGTGCGGG AGGGCGGGTT 3720 

CCTGCACTCG GCGGCCGAGT TCGACGCGGA GTTCTTCGGG ATATCGCCGC GCGAGGCCAC 3780 

GGCCATGGAC CCGCAGCAGC GCTTGCTGCT GGAAGCGTCG TGGGAGGCCC TCGAGGACGC 3840 

CGGAGTCCTC CCCGAGTCAC TGCGCGGCGG CGACGCCGGA GTGTTCGTCG GCGCCACCGC 3900 

ACCGGAGTAC GGGCCGAGGC TTCACGAGGG AGCGGACGGA TACGAGGGGT ACCTGCTCAC 3960" 

CGGCACCACC GCGAGCGTGG CCTCCGGCCG GATCGCCTAC ACCCTCGGCA CCGGCGGACC 4020 

GGCGCTCACC GTCGACACCG CGTGCTCCTC GTCCCTGGTG GCGCTGCACC TCGCCGTCCA 4080 

GGCGCTGCGC CGGGGCGAGT GCGGGCTGGC TCTGGCGGGC GGCGCCACGG TGATGTCGGG 4140 

30 GCCCGGCATC TTCGTGGAGT TCTCGCGGCA GCGCGGGCTC GCCCCCGACG GCCGCTGCAT 4200 

GCCGTTCTCC GCCGATGCCG ACGGTACGGC CTGGTCCGAG GGTGTCGCCG TACTGGCACT 4260 

GGAGCGGCTC TCCGACGCCC GGCGTGCGGG ACACCGGGTG CTGGGCGTGG TCCGGGGCAG 4320 

TGCGGTCAAC CAGGACGGTC CCAGCAACGG CCTGACCGCT CCCAACCGCT CCGCGCAGGA 4380 

GGGCGTCATC CGAGCTGCCC TGGCCGACGC CGGCCTCGCG CCGGGTCACG TCGACGCGGT 4440 

GGAGGCGCAC GGTACGGGGA CGGCGCTGGG CGATCCGATC GkGGCGAGCG CGCTGCTGGC 4500 

CACGTACGGG CGTGAGCGGG TGGGCGACCC CTTGTGGCTC GGGTCGCTGA AGTCCAACGT 4560 

CGGTCACACC CAGGCCGCCG CGGGGGCCGC GGGTGTGGTC AAGATGCTGC TTGCCCTGGA 4620 

GCACGGCACG CTGCCGCGGA CACTTCACGC GGACCGGCCC AGCACGCACG TCGACTCGTC 4680. 

GTCGGGCACC GTCGCCCTCC TGGCAGAGGC GCGCCGGTGG CCCCGGCGGT CGGACCGCCC 4740 

GCGCCGGGCG GCTGTGTCGT CGTTCGGGAT CAGTGGGACG AACGCGCATC TGATCATCGA 4800 

50 GGAGGCGCCX5 GAGTGGGTCG AGGACATCGA CGGCGTCGCT GCTCCTGACC GCGGTACCGC 4860 

GGACGCGGCT GCTCCGTCGC CGCTCTTGTT GTCCGCGCGG TCGGAGGGGG CGTTCCGGGC 4920 

GCAGGCGGTG CGGTTGGGTG AGTACGTGGA GCGGGTGGGT GCGGATCCGC GGGATGTCGC 4980 

TTATTCGCTG GCTTCGACGC GGACTCTTTT CGAGCACCGT GCGGTCGTGC CGTCTCGTCG 5040 
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GCGTCGGGAG CTCGTCGCTC CTCTTGGTGG GTTTGCTCCC GGGAGGGTGT CTGGGGGTGT 5100 
GCOGTCCGGG CGGGC^GTGC CGGGTGGGGT GGGGGTGTTG TTCACGGGTC AGCGTGCGCA 5160 
GTGGGTTGGT ATGGGGCGTG GGTTGTATGC GGGGGGTGGG GTCTTTGCGG AGGTCCTCGA 5220' 
TGAGGTGTTG TCGATGGTGG GGGAGGTOGA TCGTCGGTCG TTGCGGGATC TGATCTTCGG 5280 
CGACGTCGAC GTGGACGCGG GTGCCGGGGC TGATGCGGGT GCCGGTCCGG GTCCTGGGGT 5340 
CGGTTCTGGT TCCGGTTCTC TGGGTGGCTT GTTCGGTCGG ACGGAGTTTC CTCAGCCICC 5400 
GCTGTTTGCG TTGGAGGTGG CGTTGTTCCG GGCGTTGGAG GCTCGGGGTG TCGAGGTGTC 5460 
GGTGGTGTTG GGTCATTCGG TGGGGGAGGT GGCTGCTGCG TATGTCGCGG GGGTCTTGTC 5520 
GTTGGGTGAT GCGGTGCGGT TGGTGGTGGC GCGGGGTGGG TTGATGGGTG GGTTGCCGGT 5580 
GGGTGGGGGG ATGTGGTCGG TGGGGGCGTC GGAGTCGGTG GTGCGGGGGG TTGTTGAGGG 5640 

GTTGGGGGAG TGGGTGTCGG TTGCGGCGGT GAATGGGCCG CGGTCGGTCG TCTTCTCGGG 5700 

TCATGTGGGT GTGCTGGAGT CGGTGGTTGC CTCGCTGATG GGGGATGGGG TGGAGTGCCG 5760 

GCGGTTGGAT GTGTCGCATG GGTTTCATTC GGTGTTGATG GAGCCGGTGT TGGGGGAGTT 5820 

CCGGGGGGTT GTGGAGTCGT TGGAGTTCGG TCGGGTGCGG CCGGGTCTGG TGGTGGTGTC 5880 

GGGTGTGTCG GGTGGGGTGG TGGGTTCGGG GGAGTTGGGG GATCCGGGGT ATTGGGTGCG 5940- 

TCATGCGCGG GAGGCGGTGC GTTTCGCGGA TGGGGTGGGG GTGGTGCGTG GTCTCGGTGT 6000 

GGGGACGTPG GTGGAGGTGG GTCCGCATGG GGTGCTGACG GGGATGGCGG GTGAGTGCCT 6060 

GGGGGCCGGT GATGATGTGG TGGTGGTGCC GGCGATGCGG CGGGGCCGTG CGGAGCGGGA 6120 

GGTGTTCGAG GCGGCGCTGG CGACGGTGTT CACCCGGGAC GCCGGCCTGG ACGCCACGGC 6180 

ACTCCACACC GGGAGCACCG GCCGGCGCAT CGACCTCCCC ACCTACCCCT TCCAACGCGA 6240 

CCGCTACTGG CTGGACCCCG TTCGCACCGC CGTCACCGGC GTCGAGCCCG CCGGCTCGCC 6300 

GGCGGACGCT CGGGCCACTG AGCGGGGACG GTCGACGACG GCCGGGATCC GCTACCGCGT 6360 

CGCTTGGCAG CCGGCCGTCG TCGACCGCGG CAACCCCGGG CCTGCCGGTC ATCTGCTGCT 6420 

TCTGGCCCCG GACGAGGACA CGGCCGACTC CGGACTCGCC CCCGCGATCG CACGTGAACT 6480 

CGCCGTGCGC GGGGCCGAGG TCCACACCGT CGCCGTGCCG GTCGGTACAG GCCGGGAGGC 6540 

AGCCGGGGAC CTCTTGCGGG CCGCCGGTGA CGGTGCCGCC CGCAGCACCC GAG1TCTCTG 6600 

GCTCGCCCCG GCCGAGCCGG ACGCGGCCGA CGCCGTCGCC CTCGTCCAGG CGCTCGGCGA 6660 

GCCGGTACCC GAAGCCCCGC TCTGGATCAC CACCCGTGAG GCGGCGGCCG TCCGGCCGGA 6720 

CGAGACCCCT TCCGTCGGGG GCGCTCAGCT GTGGGGACTC GGACAGGTCG CCGCGCTCGA 6780 

ACTGGGGCGG CGCTGGGGCG GCTTGGCGGA CCTCCCCGGG AGTGCGTCGC CCGCGGTGCT 6840 
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CCAGCAOGAC TACGCGACCC AGCTGGGGGA 
CACGGCCACC CTCOCCAGTG TGATCTCCGG 

5 

GCCGGCGCTC ACGGTGGACA CGGCGTGTTC 
GCAGTCGTTG CGGCGGGGTG AGTGTGATCT 

10 GACGCCGACG GTGTTCGTGG ACTTCTCGCG 

CAAGGCGTTC GCGGACGCTG CGGACGCGAC 
GGTGGAGCGG CTTTCCGACG CGCGCCGCAA 

15 CAGTGCGGTC AATCAGGACG GTGCGAGCAA 

GCAGCGGGTG ATCCGTGAGG CGCTGGCTGA 
GGTGGAGGCG CACGGTACGG GGACGGCGCT 

20 

GGCCACGTAC GGGCGGGAGC GGGTCGGCGA 
CATCGGGCAT GCGCAGGCGG CTGCGGGTGT 
GCGGCATGGG TCGTTGCCGC GGACGCTGCA 

25 

GGCTTCGGGT GCGGTGGAGC TGCTGACCGA 
GGTGCGGCGG GCCGCGGTGT CGGCGTTCGG 

30 GGAGGAAGCG CCGGCGGAGG CCGGGAGCGA 

CGACGCGGTG ACGGGTCCGT TGTCGTGGGT 
GGCGCAGGCG GTGCGGTTGC GTGAGTGTGT 

35 GGCGGGGTCG TTGGTGCTGT CGCCTGCGTC 

GGGGCGTGAG GAGTTGCTGG CGGGTCTGGA 
GTCTTCGGGG GCCGGTGCTG TGGTGCGGGG 

40 

GTTGTTCACG GGTCAGGGTG CGCAGTGGGT 
TGGGGTGTTT GCGGAGGTCC TGGATGAGGT 
GTCGTTGCGG GATGTGATGT TCGCGGATGC 

45 

GACGGAGTTT GCTCAGCCTG CGTTCTTTGC 
GGCTCGGGGT GTGGAGGTGT CGGTGGTGTT 
SO GTATGTGGCG GGGGTGTTGT CGTTGGGTGA 

GTTGATGGGT GGGTTGCCGG TGGGTGGGGG 
GGTGCGGGGG GTTGTTGAGG GGTTGGGGGA 

55 

GCGGTCGGTG GTGTTGTCGG GTGATGTGGG 



CGCCGCCGAC ACCTACGGCG GGCATCTGCT 8700 

TCGGGTTGCC TATGCGTTGG GGTTOCAGGC 6760 

GTCGTCGTTG GTGGCGTTGC ATCTGGCGGT 8820 

GGCGTTGGCC GGTGGGGTGA CGGTGATGGC 8880 

GCAGCGGGGG CTGGCGGCGG ACGGGCGGTG 8940 

GGCGTVGGGCG GAGGGTGTGG GTGTCCTCCT 9000 

CGGTCATCGG GTGCTGGCGG TGGTGCGGGG 9060 

TGGGCTGACG GCGCCGAGTG GTCCGGCGCA 9120 

TGCGGGGCTG GTGCCCGCCG ACGTQGATGT 9180 

GGGTGATCCG ATCGAGGCGG GTGCGCTGCT 9240 

TCCGTTGTGG CTCGGGTCGT TGAAGTCGAA 9300 

GGGTGGTGTG ATCAAGGTGG TGCAGGGGAT 9360 

TGTGGATGCG CCGTCGTCGA AGGTGGAGTG 9420 

GACCCGGTCG TGGCCGCGGC GGGTGGAGCG 9480 

GGTGAGCGGG ACCAACGCCC ATCTGGTCCT 9540 

GCACGGGGAC GGCCCTGAAC CTGAGCGGCC 9600 

GCTTTCTGCG CGGTCGGAGG GGGCGTTGCG 9660 

GGAGCGGGTG GGTGCGGATC CGCGGGATGT 9720 

GTTCGGTGAG CGTGCGGTGG TGGTGGGCCG 9780 

TGTGGTGGCT GCCGGGGCTC CTGTGGGTGT 9840 

GAGTGCGGTG CGGGGTCGTG GGGTGGGGGT 9900 

TGGTATGGGG CGTGGGTTGT ATGCGGGGGG 9960" 

GTTGTCGGTG GTGGGGGAGG TGGATGGTCG 10020 

TGACTCGGTT TTGGGTGGGT TGTTGGGTCG 10080 

GTTGGAGGTG GCGTTGTTCC GGGCGTTQGA 10140 

GGGTCATTCG GTGGGGGAGG TGGCTGCTGC 10200 

TGCGGTGCGG TTGGTGGTGG CGCGGGGTGG 10260 

GATGTGGTCG GTGGGGGCGT CGGAGTCGGT 10320 

GTGGGTGTCG GTTGCGGCGG TGAATGGGCC 10380 

TGTGCTGGAG TCGGTGGTTG TCACGCTGAT 10440 
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GGGGGATGGG GTCGAGTGCC GGCGGTTCGA 
GGACCCGGTG TTGGGGGAGT TCCGGGGGGT 

5 

GCCGGGTCTC GTGGTGGTCT CGGGTGTGTC 
GGATCCGGGG TATTGGGTGC GTCATGCGCG 

10 GGTGGTGCGT GGTCTGGGTG TGGGGACGTT 

GGGGATGGCG GGTCAGTGCC TGGAGGCCGG 
GCGGGGCCGT CCGGAGCGGG AGGTGTTCGA 

15 CGCCGGCCTC GACGCCACGA CACTCCACAC 

CACCTACCCC TTCCAACACA ACCGCTACTG 
CACCTCGGCA GCCGCGCGCT TCGGCCTGGA 

20 

CACGCCGATA GCCGGCTCCG GCGCGCTGCT 
CCCGTGGCTG GCCGACCACG CCATCTCCGG 
CGACCTGCTG CTGCGGGCGG TCGAGGAGGT 

25 

CCATGAGCCC CTGCTCCTCC CCGAGCGAGG 
GGCCGACGAG CAGCGACGGC GTGCCGTGGC 
30 GGACGGTGAG GAACAGGAGT GGACCCGGCA 

GGCCGTTCCG GACATGGGCT GGGCCGCCGG 
CGACGTCGAG GAGCTGTACG ACGCGTTCGC 

35 

CACCGCACTG TCCGGCGTGT GGCGTCTCGG 
CGCGGGGQGC GCGGGCACGA CCGGTGACGG 
GGCCCTCCAC CCGTGGCGOG CCGGCGGGCT 

40 

GCCGTTCTCC TGGCAGGGCA TCGCGCTCCA 
ACTGGCCCCT GCGGCCGGCG GCACCGAGTC 

45 GGGCACCCCG GTCCTCACCC TCGACGCACT 

CGACGCGCCG CAACCGCTCT ACCGCGTCGA 
CTCCGGCGCC CAGGGCTGGA CGGTGCTCGG 

50 CGCCGCCCAT GCGGACCTCA CCGCCCTCCG 

GCCCCGGCTG GTGGTCGTGT CGCCGGTGGA 
GGACGCCGAG GCTCGGGCCC GTGCGGGTGA 

55 

CCTCGGGCGC GGCCTCACCC TGCTCCGGGA 



TGTGTCGCAT GGGTTTCATT CGGTGTTGAT 10500 

TGTGGAGTCG TTGGAGTTCG GTCGGGTGCG 10560 

GGGTGGGGTG GTGGGTTCGG GCGACTTGGG 10620 

GGAGGCGGTG CGTTTCGOGG ATGGGGTCGG 10680 

GGTGGAGGTG GGTCCGCATG GGGTGCTGAC 10740 

TCATGATGTC GTGGTGGTGC CGGCGATGCG 10800 

GGCGGCGCTG GCGACGGTGT TCACCCGGGA 10860 

CGGGAGCACC GGCCGACGCA TCGACCTCCC 10920 

GGCAACCGGC TCAGTGACCG GTGCGACCGG 10980 

GTGGAAGGAC CACCCCTTCC TCAGCGGCGC 11040 

CCTCACCGGC AGGGTGGGGC TCGCTGCCCA 11100 

CACGGTGCTG CTCCCCGGAA CGGCGATCGC 11160 

CGGCGCCGGA GGGGTCGAGG AACTGACGCT 11220 

CGGCCTGCAC GTCCAGGTGC TGGTCGAGGC 11280 

AGTCGCCGCA CGCCCGGAGG GCCCTCGGCG 11340 

CGCGGAAGGC GTGCTCACCT CCACCGAGAC 11400 

GGCCTGGCCG CCGCCCGGTG CCGAGCCGAT 11460 

CGCGGACGGC TACGGCTACG GCCCGGCCTT 11520 

CGACGAACTC TTCGCCGAGG TCCGGCGGCC 11580 

TTTCGGCGTC CACCCCGCAC TCTTCGATGC 11640 

GCTGCCCGAC ACGGGCGGCA CCACCTGGGC 11700 

CACCACCGGA GCCGAGACGC TCCGCGTCAG 11760 

GGCCTTCTCC GTACAGGCCG CCGACCCGGC 11820 

GCTGCTCCGC CCGGTGACCC TGGGGAGGGC 11880 

CTGGCAGCCG GTCGGCCAGG GGACCGAGGC 11940 

GCAGGCCGCG GCCGAGACGG TCGCGCAGCC 12000 

TACGGCTGTG GCCGCGGCGG GAACACCCGT 12060 

CACCCGGCTG GACGAGGGGC CGGTGCTGGC 12120 

CGGCTGGGAC GACGATCCCC TACGTGTCGC 12180 

GTGGGTCGAG GACGAACGGT TGGCGGACTC 12240 
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CCGGCTCGTC GTCCTCACCC GTOGCOCGGT GGCCGCCGGT CCCGGCGATC TCCCGGACCT 12300 

GACAGGTCCG GCCCTCTCOG GGCTGCTCCG CTCCGCGCAG TCGGAGTATC CGCACCGCTT 12360 

5 

CACCCTCATC GACGTGGACG ATTCCCCCGA GTCCCGTGCG GCTCTGCCCC GGGCTCTGGG 12420 

ATCGGCCGAG CGACAACTCG CCCTGCGGAC GGGCGACGTG CTGGCGCCGG CCCTCGTCCC 12480 

to GATGGCCACC CGGCCGGCGG AGACCACTCC AGCGACGGCG GTCGCCTCGG CGACAACACA 12540 

GACACAGGTC ACCGCGCCCG CTCCCGACGA CCCGGCTGCG GATGCCGTGT TCGACCCGGC 12600 

GGGCACCGTA CTGATCACCG GCGGCACCGG CGCCCTGGGA CGGCGTGTCG CCTCGCACCT 12660 

,5 CGCGCGCCGG TACGGCGTAC GCCACATGCT TCTGGTCAGC AGGCGTGGAC CGGACGCCCC 12720 

CGAGGCCGGT CCCCTGGAAC GGGAACTCGC CGGTCTCGGA GTCACCGCCA CCTTCCTGGC 12780 

ATGCGACCTC ACCGACATCG AGGCCGTACG GAAGGCCGTC GCCGCGGTGC CGTCGGACCA 12840 

20 

CCCGCTGACC GGTGTGGTGC ACACCGCCGG CGTGCTGGAC GACGGCGCCC TGACCGGCCT 12900 

GACCCGGCAA CGCCTCGACA CCGTGCTGCG GCCCAAGGCC GACGCCGTGC GGAACCTCCA 12960 

25 CGAGGCGACC CTCGACCGGC CGCTGCGCGC GTTCGTCCTG TTCTCCGCCG CCGCCGGACT 13020 

CCTGGGCCGC CCCGGGCAGG CCTCCTACGC CGCCGCCAAC GCGGTCCTCG ACGCGCTCGC 13080 

GGGAGCCCGC CGCGCGGCCG GACTGCCCGC AGTGTCCCTG GCGTCGGGCC TGTGGGACGA 13140 

30 GCAGACGGGC ATGGCAGGAG GCCTCGACGA GATGGCCCTC CGCGTGCTGC GCCGGGACGG 13200 

CATCGCCGCG ATGCCTCCGG AGCAGGGGCT CGAACTGCTC GACCTGGCCC TGACCGGACA 13260 

CCGGGACGGA CCCGCCGTCC TCGTCCCCCT CCTCCTCGAC GGCGCGGCCC TGCGCCGCAC 13320 

GGCGAAGGAG CGCGGCGCGG CCACGATGTC CCCCTTGCTC CGCGCCCTGC TGCCCGCCGC 13380 

CCTGCGCCGC AGCGGTGGAG CCGGCGCCCC CGCGGCGGCC GACCGGCACG GCAAGGAGGC 13440 

GGACCCCGGT GCGGGACGCC TCGCAGGGAT GGTGGCACTC GAAGCGGCGG AGCGTTCCGC 13500 

40 

GGCCGTCCTT GAGCTGGTCA CCGAACAGGT CGCCGAGGTC CTCGGCTACG CGTCGGCCGC 13560 

GGAGATCGAG CCCGAACGAC CCTTCCGGGA GATCGGCGTC GACTCCCTGG CGGCGGTGGA 13620 

45 GCTGCGCAAC CGGCTCAGCC GTCTGGTCGG CCTCCGGTTG CCGACCACGC TGTCCTTCGA 13680 

CCACCCCACG CCGAAGGACA TGGCGCAGCA CATCGACGGG CAGCTCCCCC GCCCGGCCGG 13740 

AGCCTCGCCC GCGGACGCAG CGCTGGAAGG GATCGGCGAC CTCGCGCGGG CGGTCGCCCT 13800 

^0 GCTGGGCACG GGCGACGCCC GCCGGGCCGA GGTACGAGAG CAGCTCGTCG GACTGCTGGC 13860 

CGCGCTCGAC CCACCTGGGC GGACGGGCAC CXXTCGCACCC GGCGTCCCCT CCGGTGCCGA 13920 

TQGCGCGGAA CCGACCGTCA CGGACCGGCT CGACGAGGCG ACCGACGACG AGATCTTCGC 13980 

55 

CTTCCTGGAC GAGCAGCTGT GACCACACCG TGGACCGACC GCATCCCGAG GAGTTCGTCG 14040 
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CAGCAATGAC CGCCGAGAAC GACAAGATCC 
TCCACCGGAC CAAGTCCCGC CTGGCCGAGG 

5 

TCCTGGGCAT GGCGTGCCGT TACCCGGGCG 
TGCTGGCAGC CGGTACGGAC GCOGTCTCCG 

w AGGGGCTGTA CGACCCCGAT CCGGAGGCGG 

TCCTGCACTC GGCGGCCGAG TTCGACGCGG 
CGGCGATGGA TCCGCAGCAG CGGTTCCTGC 

IS CGGGGATCGT CCCCGCGTCG CTGCGCGGCA 

ACGACGACTA CGGGTCGCGG TTCGACTCGG 
ACGGCAGCGC CGGCAGCATC GCGTCCGGTC 

20 CGGCGCTGAC GGTGGACACG GCGTGTTCGT 

AGTCGTTGCG GCGGGGTGAG TGTGATCTGG 
CGCCGACGGT GCTCGTGGAG TTCTCGCGGC 

25 

AGGCGTTCGC GGAGGGTGCG GACGGGACGG 
TGGAGCGGCT CTCCGACGCC CGCCGCAATG 

30 GTGCGGTCAA TCAGGACGGT GCGAGCAACG 

AGCGGGTGAT CCGTGAGGCG CTGGCCGACG 
TCGAGGCGCA CGGCACCGGC ACACCCCTGG 

35 CCACCTATGG CAGTGAGCGC CAGGGCCAAG 

ACATCGGGCA TGCGCAGGCG GCTGCGGGTG 
TGCGGCATGG GTCGTTGCCG CGGACGCTGC 

40 

GGGCTTCGGG TGCGGTGCAG CTGCTCACCG 
TGGAGGAAGC GCCGGCGGAG GCCGGGAGCG 

45 

CCGACGCGGT GACGGGTCCG TTGTCGTGGG 
GGGCGCAGGC GGTGCGGTTG CGTGAGTGTG 

SO TQGCGGGGTC GTTGGTCGTG TCGCGTGCGT 

GGGGGCGTGA GGAGTTGCTG GCGGGTCTGG 
TCTCCGGGGG CGTGTCTTCG GGGGCCGGTG 

bS GTGGGGTGGG GGTGTTGTTC ACGGGTCAGG 



GCAGCTACCT GAAGCGTGCC ACCGCCGAAC 14100 

TCGAGTCGGC GAGCCGCGAG CCGATCGCGA 14160 

GTGTCGCGTC GCCGGACGAC CTGTGGGACC 14220 

CGTTCCCCGT CGACCGTGGC TGGGACGTCG 14280 

TGGGGCGTAG TTACGTCCGC CAGGGCGOCT 14340 

AGTTCTTCGG GATCTCGCCC CGTGAGGCGG 14400 

TGGAGACGTC GTGGGAGGCG CTGGAGCGGG 14460 

CCCGTACCGG CGTCTTCACC. GGCGTCATGT 14520 

CTCCGCCGGA GTACGAGGGC TACCTCGTCA 14580 

GGGTTGCCTA TGCGTTGGGG TTGGAGGGGC 14640 

CGTCGTTGGT GGCGTTGCAT CTGGCGGTGC 14700 

CGTTGGCCGG TGGGGTGACG GTGATGGCGA 14760 

AGCGGGGGCT GGCGGCGGAC GGGCGGTGCA 14820 

CGTGGGCCGA GGGTGTGGGC GTGCTGCTGG 14880 

GCCATCGGGT GCTGGCGGTG GTCCGGGGCA 14940 

GGCTGACGGC GCCGAGTCGT CCTGCGCAGC 15000 

CGGGGCTGAC GCCCGCCGAC GTCGACGCGG 15060 

GCGACCCCAT CGAGGCGGGT GCGTTGCTGG 15120 

GTCCGTTGTC GTTGGGGTCG TTGAAGTCGA 15180 

TGGGTCGCGT GATCAAGGTG GTGCAGGCGA 15240* 

ATGTGGATGC GCCGTCGTCG AAGGTGGAGT 153 00 

AGACCCGGTC GTGGCCGCGG CGGGTGGAGC 15360 

GGGTGAGCGG GACCAACGCC CATGTGGTCC 15420 

AGCACGGGGA CGGCCCTGAA CCCGAGCGGC 15480 

TGCTTTCTGC GCGGTCGGAG GGGGCGTTGC 15540 

TGGAGCGGGT GGGTGCGGAT CCGCGGGATG 15600 

CGTTCGGTGA GCGTGCGGTG GTGGTGGGCC 15660 

ATGTGGTGGC TGCCGGGGCT CCTGTGGGTG 15720 

CTGTGGTGCG GGGGAGTGCG GTGCGGGGTC 15780 

GTGCGCAGTG GGTTGGTATG GGGCGTGGGT 15840 
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TGTATGCGGG GGGTGGGGTG TTTGCGGAGG TGCTGGATGA GGTGTTGTCG GTGGTGGGOG 15900 

AGGTGGGGGG TTGGTCGTTG CGGGATGTGA TGTTCGGCGA CGTCGACCTC GACGCGGGTC 15960 

5 

CCGGGGCTGA TGCGGGTGTC GGTTCGGGTG TTGGTGTGGG TGGGTTGTTG GGTCGGACGG 16020 

AGTTTGCTCA GCCTCCGTTG TTTGCGTTGG AGGTGGCGTT GTTCCGGGCG TTGGAGGCTC 16080 

10 GGGGTGTCGA GGTGTCGGTG GTGTTGGGTC ATTCGGTGGG GGAGGTCGCT GCTGCGTATC 16140 

TGGCGGGGCT GTTGTCGTTG GGTGATGCGG TGCGGTTGGT GGTGGCGCGG GGTGGGTTGA 16200 

TGGGTGGGTT GCCGGTGGGT GGGGGGATCT GGTCGGTGGG GGCGTCGGAG TCGGTGGTGC 16260 

15 GGGGGCTTGT TGAGGGGTTG GGGGAGTGGG TGTCGGTTGC GGCGGTGAAT GGGCCGCGGT 16320 

CGGTGGTGTT GTCGGGTGAT GTCGGTGTGC TGGAGTCGGT GGTTGCCTCG CTGATCGGGG 16380 

ATGGGGTGGA GTGCCGGCGG TTGGATCTGT CGCATGGGTT TCATTCGGTG TTCATCGAGC 16440 

20 

CGGTGTTCGG GGAGTTCCGG GGGGTTGTGG AGTCGTTCGA GTTCGGTCGG GTGCGGCCGG 16500 

GTCTGGTGGT GGTGTCGAGT GTGTCGGGTG GGGTGGTGGG TTCGGGGGAG TTGGGGGATC 16560 

CGGGGTATTG GGTGCGTCAT GCGCGGGAGG CGGTGCGTTT CGCGGATGGG GTGGGGGTGG 16620 

25 

TGCGTGGTCT GGGTGTGGGG ACGTTGGTGG AGGTGGGTCC GCATGGGGTG CTGACGGGGA 16680 

TGGCGGGTGA GTGCCTGGGG GCCGGTGATC ATGTGGTGGT GGTGCCGGCG ATCCGGCGGG 16740 

30 GCCGTGCGGA GCGGGAGGTG TTCGAGGCGG CGCTCGCGAC GGTCTTCACC CGGGACGCCG 16800 

GCCTGGACGC CACGACACTC CACACCGGGA GCACCGGCCG ACGCATCGAC CTCCCCACCT 16860 

ACCCCTTCCA ACACGACCGC TACTGGCTGG CCGCCCCGTC CCGGCCCAGG ACGGACGGGC 16920 

35 TGTCGGCGGC GGGTCTGCGC GAGGTGGAGC ACCCCCTGCT CACCGCCGCC GTGGAACTCC 16980 

CCGGCACCGA CACCGAGGTG TGGACCGGCC GCATATCCGC TCCCGACCTG CCCTGGCTCG 17040 

CCGACCACCT GGTGTGGGAC CGAGGCGTGG TGCCGGGGAC CGCGCTGCTG GAGACGGTGC 17100 

40 

TCCAGGTGGG AAGCCGGATC GGTCTGCCGC GCGTCGCCGA ACTGGtCCTG GAGACGCCGC 17160 

TCACCTGCAC GTCGGACCGC CCGCTCCAGG TCCGGATCGT CGTGACCGCT GCCGCCACCG 17220 

^ CCCCCGGGGG CGCGCGTGAG CTGACCCTCC ACTCGCGGCC CGAGCCCGTG GCCGCCTCCT 17280 

CGTCCTCCCC GAGTCCCGCC TCTCCCCGGC ACCTCACGGC GCAGGAGAGC GACGACGACT 17340 

GGACCCGGCA TGCCTCAGGG CTGCTCGCCC CGGCTGCCGG CCTCGCCGAC GACTTCGCCG 17400 

SO AGCTCACCGG CGCCTGGCCC CCCGTCGGCG CCGAGCCCCT CGACCTCGCC GGTCAGTACC 17460 

CGCTCTTCGC AGCCGCCGGA GTGCGCTACG AAGGCGCCTT CCGAGGGCTC CGCGCGGCAT 17520 

GGCGTCGAGG CGACGAGGTC TTCGCCGACG TACGGCTGCC CGACGCGCAC GCGGTCGACG 17580 

55 

CTGATCGTTA CGGGGTGCAC CCCGCCCTGC TCGACGCGGT GCTCCACCCG ATCGCGTCGC 17640 
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TGGACCCGCT GGGCGACGGC GGGCACGGTC TGCTGCCGTT CTCCTGGACC GACGTACAGG 17700 

GACACGGCGC CGGCGGACAC GCCCTCCGGG TACGGGTGGC GGCCGTCGAC GGCCGCGCGG 17760 

5 

TGTCGGTCAC CGCGGCCGAC CACGCGGGCA ACCCGGTGTT ATCCGCCCGG TCCCTGGCAC 17820 

TGCGTCGTAT CACCGCGGAC CGGCTTCCCG CCGCGCCCGT CGCCCCTCTC TACCGCGTGG 17880 

,0 ACTGGCTGCC GTTCCCGGGT CCGGTGCCCG TATCCGCGGG CGGCCGCTCG GCGCTCCTCG 17940 

GACCCGAGGC CGAAGCCACG GCTGCCGGAC TGCGTGCGGT GGGCCTCGAC GTGCGTACCC 18000 

ATGCGCTCCC CCTCGGAGAG CCCCTGCCTC CGCAGGCCGG TACCGACGCG GAGGTGATCA 18060 

15 TCCTCGACCT GACCACCACC GCAGCCGGCC GTACGGCGTC GGACGGGGGG CGGCTCAGTC 18120 

TCCTCGACGA GGTGCGTGCG ACGGTGCGCC GGACCCTCGA AGCCGTACAG GCCCGCCTCG 18180 

CCGACACCGA AACGGCCCCC GACGTCGACG TCCGTACGGC CGCGCGCCCC CGCACAGCCG 18240 

20 

CCCGTACAAG CCCCCGCGTG GACACCCGCA CGGGAGCCCG CACCGCTGAC GGCCCCCGGC 18300 

TCGTCGTCCT GACCCGGGGC GCGGCCGGAC CCGAGGGAGG CGCGGCCGAT CCCGCGGGTG 18360 

CCGCTGTCTG GGGGCTCGTC CGGGTCGCCC AGGCCGAACA GCCCGGCCGC TTCACCCTGG 18420 

25 

TGGACGTCGA CGGCACCCAG GCGTCGCTGC GGGCCCTGCC CGGTCTCCTG GCCACGGATG 18480 

CCGGCCAGTC GGCCGTGCGC GACGGACGTG TCACCGTCCC GCGCCTCGTC CCGGTCGCCG 18540 

30 ACCCCGTCCC CCACGGCGGC GGCACGGCGG CCGACGGGAC GGGTGCCGGC GAGCCGTCCG 18600 

CGACCCTGGA CCCCGAAGGC ACCGTGCTGA TCACCGGCGG CACCGGAGCA CTGGCCGCGG 18660 

3 AAACCGCCCG GCACCTGGTC GACCGGCACA AGGTGCGCCA TCTCCTGCTG GTOGGCAGGC 18720 

35 GCGGTCCCGA CGCACCCGGC GTCGATCGAC TGGTCGCCGA GTTGACCGAG TCGGGTGCCG 18780 

AGGTCGCCGT ACGGGCCTGT GACGTCACGG ACCGCGACGC CCTCCGCCGC CTGCTCGACG 18840 

CACTCCCCGA CGAACACCCG CTGACCTGCG TGGTGCACAC CGCCGGGGTG CTCGACGACG 18900 

40 

GCGTGCTCTC CGCCCAGACG GCCGAGCGGA TCGACACGGT GCTCCGGCCC AAGGCCGACG 18960 

CCGCCGTCCA CCTGGACGAG CTGACCCGGG AGATCGGACG GGTGCCCCTG GTGCTGTACT 19020 

^ CCTCGGTCTC GGCCACCCTG GGCAGCGCGG GGCAGGCCGG GTACGCGGCG GCCAACGCCT 19080 

TCATGGACGC GCTGGCCGCC CGGCGGTGCG CCGCCGGGCA CCCCGCGCTG TCGCTCGGCT 19140 

GGGGCTGGTG GTCCGGGGTG GGTCTCGCCA CCGGACTGGA CGGAGCGGAC GCGGCGCGGG 19200 

50 TCAGGCGCTC GGGTCTCGCC CCGCTCGACG CCGGCGCCGC ACTGGACCTG CTCGACCGGG 19260 

CGCTGACCCG GCCCGAGCCG GCCCTGCTCC CCGTCCGGCT CXJACCTGCGC GCCGCGGCCG 19320 

GTGCCACCGC TCTCCCGGAG GTCCTCCGTG ACCTGGCCGG CGTACCGGCG GACGCCCGCA 19380 

55 

GCACGCCCGG GGCCGCGGCG GGCACCGGGG ACGAGGACGG TGCCGTGCGC CCTGCCCCCG 19440 
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20 
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30 



35 



40 



45 



SO 



55 



CCCCGGCCGA CGCCGCCGGG ACGCTGGCCG 
GTACGGCTCT CCTGCTCGAC CTGGTGCGGA 
ACCCCGCCGC GATCGGCGCC GCCCGCACCT 
CTCTCGACCT CCGCAACCGG CTGAACACAC 
TCTTCGACCA CCCCACACCG CTCGCCCTCG 
CCGCTCCAGC GGAACCGGCC GCTGAGGTCC 
CCGOCGTGAT CGACCGGCTG GAACGCAGCC 
TCCGCGCGGC ACGGCGGCTG CGCGGCCTCC 
CGTCCGGTCC GGATGCCGGA GAGCACGCCC 
TCAGGTCGGC CTCCGACGAC GACTTGTTCG 
GACCGCGCCG CGCGCCGACC GCTGAACCGC 
AGGAGAACCG TGTCTGCGAC CAACGAGGAG 
GCCGACCTGC ACAGCGCACG AGAGCGGTTG 
ATCGCGATCG TCGGCATGGC GTCCCGTTAC 
TGGGACCTOG TGGCCGCCGG TACGGACGCG 
GACGCCGAGG GTCTGTACGA CCCGGAGCCG 
GGCGGGTTCC TGCACTCGGC GGCCGAGTTC 
GAGGCGGCGG CGATGGATCC GCkGCAGCGG 
GAGCGGGCCG GGATCGTCCC CGCGTCGCTC 
GTCATCTACC ACGACTACGG CAGCCACCAG 
CTCGGCCTCG GCACCGCGGG GAGCGTCGCC 
CAGGGGCCGG CCGTCACCAT GGACACGGCA 
GCGGTGCAGT CGTTGCGGCG GGGCGAGTCC 
TTGGCGACGC CCACGGTGTT CGTGGAGTTC 
CGGTCCAAGG CGTTCGCGGA GGGCGCCGAC 
CTGCTGGTGG AGCGGCTCTC CGACGCCCGC 
CGGGGCAGCG CGGTCAACCA GGACGGTGCC 
GCCCAGCAGC GGGTGATCCG TGACGCGCTG 
GACGCGGTCG AGGCGCACGG CACCGGCACA 
CTGATCGCCA CCTACGGCAG TGAACGGGTG 



CGCGGCTCGC GGGACGTTCC 
CCGAGGTCGC GGCGGTGCTC 
TCAAGGACGC CGGATTCGAC 
GCACOGGACT GCGGCTGCCC 
CCGAACTCCT GCTCGACGGG 
CGGAOGAAGC GGCCGGTOCC 
TCGCCGCGAC CGACGACGGC 
TCGACGCGCT CCCCGCCGGT 
CCGGTCGCGG CGACGTGGTG 
ACCTGCTCGA CAGCGACTTC 
TCTTCACCCA GACCCACGAG 
AAGTTGCGGG AGTACCTGCG 
CGCGAGGTCG AGTCGGCGAG 
CCGGGCGGTG TGGCGTCGCC 
ATCTCCCCGT TCCCCGTCGA 
GGGGTCCCGG GCAAGAGCTA 
GACGCGGAGT TCTTCGGGAT 
TTGCTGCTGG AGACGTCGTC 
CGCGGCACCC GTACCGGCGT 
GTCGGCACCG CCGCCGATCC 
TCGGGCCGGG TGGCGTACAC 
TGCTCGTCCT CGCTGGTGGC 
GATCTCGCGT TGGCCGGCGC 
TCGCGGCAAC GGGGGCTGGC 
GGCACGGCGT GGGCCGAGGG 
CGCAACGGCC ATCGGGTGCT 
AGCAACGGCC TCACCGCACC 
GCCGACGCGG GGCTGACGCC 
CCGCTCGGCG ACCCGATCGA 
GGCGACCCGC TGTGGCTCGG 



gcacccgagc 
ggacacggcg 
tccctcaccg 
gcgaccctcg 
ctggaggcgg 
gagaccctgt 
gacgcccggg 
cccggtgccg 
atcgaccggc 
cagtgagccc 
accacgcctg 
gcgcgcgatg 
ccgtgagccg 
ggaggagctg 
ccgcggctgg 
cgtgcgcgag 
ctcgccgcot 
ggaggcgctg 
cttcaccggc 
cagtggacag 
cctcggtcta 

GTTGCACCTG 
GGCGACGGTC 
GGCGGACGGA 
CGCCGGTGTG 
CGCCGTGCTC 
CAGCGGGCCC 
CGCCGACGTG 
GGCCGGCGCG 
TTCGCTGAAG 



19S00 

19560 

19620 

19680 

19740 

19800 

19860 

19920 

19980 

20040 

20100 

20160 

20220 

20280 

20340 

20400 

20460 

20520 

20580 

20640 

20700 

20760 

20820 

20880 

20940 

21000 

21060 

21120 

21180 

21240 
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TCGAACATCG GACACACCCA GGCCGCCGCC 
GCGTTACGGC AGTCCGAGCT GCCGCGCACC 

5 

GAATGGGACG CGGGCGCCGT GC AACTGCTC 
CACAGGCCCC GGCGGGCCGC GGTCTCCGCC 

1Q ATCATCGAGG AACCGCCCGC GGCCGGTGAC 

GGCGAGGCGA CCGCGTCCCC CTCCACCGCG 
GGGCCGCTGT CCCCCTCCTC CCCGGCCGTG 

15 GCCCTGCGCG CCCAGGCCGC CCGCCTGCGG 

CCGACCGACA TCGGCCACGC CCTGGCCGCC 
CTGCTCGGCG ACGACGGAGC CCCGGTCGAC 

20 

ACCCCCGACG CCGTCCACGG CACCGCGGCG 
GGCCAGGGTT CCCAGTGGGC CGGGATCGGC 
GCCGCCGAAC TGGACCGCTG CCAGGGCGCG 

25 

GACGTGCTGC GCGGCGCGCC CGCGGCGCCC 
GCCACCTTCG CCGTCATGGT GGGACTCGCC 

30 GCCGCCGTCA TCGGCCACTC CCAGGGCGAG 

TCCCTGGAGG ACGCCGCCCG GATCGTGGCC 
GCCGGGCGGG GCGGCATGGC CTCGGTGGCC 

35 GCCGGCGGCG TCGAGATCGC CGCCGTCAAC 

CCCGGCGCCC TGGAGGCGTT GCTCGTCACG 
ATCGACGTCG ACTACGCGTC CCACTCCCAC 

40 

ACCGTCCTCG GCCCCGTCCG GCCGCGGAGG 
GCGGCGCTCC TCGACACCGC CACCCTGGAC 
CCGGTGCGCT TCGAGCCGAC CGTACGCGCC 

45 

GAGTGCTCCG CGCATCCCGT CCTGACCGTC 
GGCGCGGTCC CGGCCCTCGC TTCGCTGCGC 

SO ACCTCCGCCG CCGAGGCCCA GGTCGTCGGC 

GGCGCCGGCC GGGTGGACCT GCCGACCTAC 
CCCGCCCGGC CCGACTCCGC GGCGACGGCC 

55 GGAGACCGGC TCGGCTACCA CGTCGCGTGG 



GGAGCCGCCG GCGTCATCAA GATQGTGCAG 21300 

CTGCACGTCG ACGCGCCCTC GGCCAAGGTC 21360 

ACCGGCGTCC GGCCATGGCC CCGGCGCGAG 21420 

TTCGGCGTCA GCGGCACCAA CGCCCACGTC 21480 

ACCTCGCCCG CCGGCGACAC CCCTOAGCCG 21540 

GCCGGGCCGTT CGTCCCCCTC CGCQGTGGCC 21600 

GTCTGGCCCC TGTCCGCCGA GACCGCCCCC 21660 

GCGCACCTCG AACGCCTCCC CGGCACCTCG 21720 

GAACGCGCCG CCCTCACCCG ACGCGTCGTG 21780 

GCACTCGCCG CCCTCGCCGC CGGCGAGACC 21840 

GACATCCGCC GGGTCGCCTT CGTGTTCCCC 21900 

GCCGAACTGC TGGACACGGC CCCGGCCTTC 21960 

CTCTCCCCGT ACGTGGACTG GAACCTCGCG 22020 

GGCCTCGACC GGGTCGACGT CGTCCAGCCG 22080 

GCGCTGTGGC GCTCCCTCGG GGTCGAACCC 22140 

ATCGCCGCGG CCTGCGTGGC GGGCGCGCTC 22200 

CTGCGCTCCC AGGTCATCGC CCGCGAACTG 22260 

CTGCCCGCGG CGGAGGTCGA GGCCCGCCTG 22320 

GGCCCCGGCT CGACCGTCGT CTGCGGAGAG 22380 

CTGGAGAGCG AAGGCACCCG GGTCCGCCGC 22440 

TACGTCGAGA GCATCCGGGC GGAACTCGCC 22500 

GGCGACGTGC CCTTCTACTC CACCGTCGAG 22560 

GCCGACTACT GGTACCGCAA CCTGCGCCTC 22620 

ATGCTCGACG ACGGCGTCGA CGCGTTCGTG 22680 

GGCGTGCGCC AGACCGTGGA GAGCGCCGGC 22740 

CGCGACGAGG GCGGGCTGCG GCGCTTCCTC 22800 

GTCCCCGTGG ACTGGGCGAC GCTCCGCCCA 22860 

GCCTTCCAGC GCGAACGCCA CTGGGTCGGC 22920 

GCCACGACCG GTCACGACGC CCCGGAGCCC 22980 

AAGGGACTGC GCTCCACCAC CGGCGGCTGG 23040 
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CGCCCCGGCC TGCGCCTGCT GATCGTGCCC ACCGGGGACC AGTACACCGC CCTCGCCGAC 23100 

ACCCTGGAAC AGGCCGTCGC CTCCTTCGGC GGAACGGTCC GCCGCGTCGC CTTCGACCCG 23160 

5 

GCACGCACCG GACGCGCCGA GCTCTTCGGC CTCCTCGAGA CGGAGATCAA CGGCGACACC 23220 

GCCGTCACCG GCGTCGTCTC GCTGCTCGGA CTGTGCACCG ACGGCAGGCC GGACCACCCC 23280 

10 GCCGTGCCCG TCGCCGTCAC CGCCACCCTC GCCCTCGTCC AGGCCCTGGC CGACCTCGGC 23340 

AGCACCGCAC CGCTGTGGAC CGTCACCTGC GGCGCGGTCG CCACCGCCCC CGACGAACTG 23400 

CCGTGCACCG CCGGTGCCCA GCTGTGGGGC CTGGGCCGGG TGGCCGCGCT GGAGCTGCCC 23460 

75 GAGGTGTGGG GCGGCCTCAT CGACCTTCCC GCGCGGCCCG ACGCCCGGGT CCTCGACCGT 23520 

CTCGCCGGCG TCCTCGCCGA ACCCGGCGGC GAGGACCAGA TCGCCGTACG GATGGCGGGC 23580 

GTCTTCGGCC GCCGGGTCCT GCOGAACCCG GCCGACTCCC GGCCCCCGGC CTGGCGCGCC 23640 

20 

CGGGGCACCG TCCTCATCGC CGGCGACCTC ACGACGGTGC CCGGCCGACT GGTCCGGTCC 23700 

CTCCTCGAGG ACGGCGCGGA CCGCGTGGTG CTGGCCGGAC CCGACGCCCC CGCACAGGCC 23760 

GCCGCCGCCG GACTGACCGG CGTCTCCCTC GTCCCCGTGC GCTGCGACGT CACCGACCGC 23820 

25 

GCCGCACTGG CCGCGCTGCT CGACGAGCAC GCGCCCACCG TCGCCGTGCA CGCCCCGCCC 23880 

CTGGTGCCCC TGGCGCCGCT GCGGGAGACG GCACCCGGCG ACATCGCCGC CGCCCTCGCC 23940 

30 GCCAAGACCA CGGCCGCCGG CCACCTGGTC GACCTGGCGC CGGCCGCGGG CCTCGACGCG 24000 

CTGGTGCTGT TCTCCTCGGT CTCCGGAGTG TGGGGCGGCG CGGCCCAGGG CGGCTACGCG 24060 

GCCGCCAGCG CGCACCTCGA CGCGCTGGCC GAACGCGCCC GCGCCGCGGG GGTGCCCGCG 24120 

35 TTCTCCGTGG CCTGGAGCCC CTGGGCCGGA GGCACGCCCG CCGACGGTGC CGAGGCGGAG 24180 

TTCCTCAGCC GGCGCGGGCT GGCTCCCCTC GACCCCGACC AGGCGGTCCG GACCCTGCGC 24240 

CGCATGCTGG AGCGCGGCAG CGCCTGCGGT GCGGTCGCCG ACGTCGAGTG GAGCCGGTTC 24300 

40 

GCCGCCTCCT ACACCTGGGT GCGTCCCGCC GTACTCTTCG ACGACATCCC GGACGTGCAG 24360 

CGGCTGCGCG CGGCCGAACT CGCCCCGAGC ACCGGAGACT CGACCACCTC CGAACTCGTC 24420 

CGCGAGCTGA CCGCGCAGTC CGGCCACAAG CGGCACGCCA CCCTGCTGCG GCTGGTGCGC 24480 

45 

GCACACGCCG CCGCCGTCCT CGGACAGTCC TCCGGCGACG CGGTGAGCAG CGCCCGCGCC 24540 

TTCCGCGACC TCGGCTTCGA CTCGCTGACC GCCCTCGAAC TGCGCGACCG GCTCAGCACC 24600 

50 AGCACCGGGC TCAAACTGCC CACCTCCCTG GTCTTCGACC ACTCCAGCCC GGCCGCGCTC 24660 

GCCCGGCACC TCGGTGAGGA ACTCCTCGGC CGGAACGACA CCGCCGACCG GGCCGGCCCC 24720 

GACACCCCGG TACGGACGGA CGAGCCCATC GCCATCATCG GCATGGCCTG CCGGCTGCCC 24780 

55 

GGCGGGGTGC AGTCCCCCGA GGACCTGTGG GACCTGCTGA CCGGTGGGAC CGACGCCATC 24840 
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ACCCCCTTCC CGACCAACCG GGGATGGGAC 
TCGCCCGGGC ACCACACCTA CCTCCCCGAG 

5 

GACCCCGGCT TCTTCGGCAT CAGCCCCCGC 
CTGATCCTGG AGACGTCCTG GGAGTCCTTC 

10 CGCGGCAGCC GCACCGGGGT CTTCGTCGGC 

CAGGACGGCG ACGAGAACTT CGACGGCTAC 
TCCGGCCGGC TCTCCTACGT CTTCGGACTG 

15 TGCTCGGCCT CCCTGGCCGC ACTGCACCTG 

GACTACGCCC TCGCCGGCGG GGCCACGGTG 
GCCCGTCAGC GAGCGGTGTC GCCGGACGGC 

20 

GGGGTCGGTC TCGCCGAGGG AGCCGGGATG 
AAGAAGGGCC ATCCGGTACT GGCGGTGGTG 
AGCAACGGCC TCACCGCACC CAGCGGGCCC 

25 

GCCGACGCGG GGCTGACGCC CGCCGACGTG 
CCGCTCGGCG ACCCCATCGA GGCCGGCGCG 

30 GACGGCCCGC TGTGGCTGGG TTCGCTGAAG 

GGCGTGGCCG GGGTGATCAA GATGGTGCTG 
CTGCACGCGT CGACGGCGTC GTCCAGGATC 

55 GACGAGGCCA GGCCCTGGCT CCAGCGGGCC 

TTCGGCATCA GCGGCACCAA CGCGCACCTC 
CCCGAACTGC TCGCGCCCGA ACCGGCCGCC 

40 

CACGAGGTGA CCGTGCCCCT GATGATGTCC 
GCGCGGCGCC TGCGCGCCGA CCTGCTCGCC 
TACACCCTCA TCACCACCCG CACCCGGTTC 

45 

TTCACGGAGC TGATCGCGGC CCTCGACGAC 
CTGCGGGGCA CCGCCGGCAC CTCCGACCAG 
50 CAGTGGCCCG AGATGGCCGA CGGGCTGCTG 

GAGACCGCCC GCGCCTGCGA CCTCGCGCTC 
GTACTGCGCC GGGAACCCGG CGCGCCCTCG 

55 

CTGTTCACCA TGATGGTCTC GCTCGCCGAG 



AACGAGACCC TCTACGACCC CGACCCCGAC 24900 

GGCGGGTTCC TGCACGACGC GGCCGAGTTC 24960 

GAGGCCCTGG CCATGGACCC GCAGCAGCGG 25020 

GAACGGGCCG GCATCGACCC GGTCGAACTG 25080 

ACCAACGGAC AGCACTACGT GCCGCTCCTC 25140 

ATCGCCACCG GCAACTCCGC CAGCGTGATG 25200 

GAGGGCCCCG CCGTCACCGT CGACACCGCC 25260 

GCGGTGCAGT CACTGCGCCG CGGCGAATGC 25320 

ATGTCCACCC CCGAGATGCT GGTGGAGTTC 25380 

CGCAGCAAGG CGTTCGCGGA GGCGGCCGAC 25440 

CTGCTCGTGG AGCGGCTGTC GGAGGCGCAG 25500 

CGGGGCAGTG CCGTCAACCA GGACGGTGCC 25560 

GCCCAGCAGC GGGTGATACG GGAGGCGCTG 25620 

GACGCGGTCG AGGCGCACGG CACCGGCACG 25680 

CTGCTCGCCA CGTACGGCCG GGACCGGCGC 25740 

TCGAACATCG GGCACACCCA GGCCGCGGCC 25800 

GCGCTGCGCC ACGGCGAGCT GCCGCGCACC 25860 

GATTGGGACG CGGGCGCCGT GGAGTTGCTG 25920 

GAGGGGCCGC GCCGGGCGGG CATCTCCTCG 25980 

GTCATCGAGG AGCCGCCGGA GCCCACCGCG 26040 

GACGGCGACG TCTGGTCCGA GGAGTGGTGG 26100 

GCGCACAACG AAGCCGCCCT GCGCGACCAG 26160 

CACCCCGAGC TGCACCCGGC CGACGTCGGC 26220 

GAGCAGCGGG CCGCCGTCGT CGGCGAGAAC 26280 

CTCGTCGAAG GCCGACCGCA CCCGCTCGTG 26340 

GTCGTGTTCG TCTTCCCCGG CCAGGGCTCG 26400 

GCCCGCTCCA GCGGCTCCGG CTCCTTCCTG 26460 

CGGCCCCACC TCGGCTGGTC CGTCCTGGAC 26520- 

CTCGACCGGG TCGACGTGGT GCAGCCCGTG 26580 

ACGTGGCGTT CGCTGGGCGT CGAACCGGCC 26640 
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GCGGTCGTCG GTCACTCCCA GGGCGAGATC 
CTCGACGACG CGGCGCGCAT CGTCGCCCTG 

5 

AAGGGCGGCA TCGTCGCCGT GACCCTGTCC 
TGGAGCGACC GGCTCGCCGT CGCCGCCGTC 

10 GACCCGGACG CCCTGGCGGA GCTGGTCGCC 

CCCATCCCCG GCGTCGACAC CGCCGGGCAC 
CTCCGGAAGG TGCTCGCGCC CGTCGCGCCC 

15 GTCACCGGAG GACTGATCGA CACCGCCGAG 

CGCGAGCCGG TGGAGTTCGA GCAGGCCACC 
TTCCTGGAGT CGAGCCCGCA CCCCATGCTG 

20 

GCCGGTTCCC CGGCGGCCGT CCTCGGCACC 
CTGGGCGTCG CCCTCTGCCG CGCCTACACC 
TTCGGCCCCG ACTCACGCCA GGTGGAACTG 

25 

TGGTACAGCC CCGGCCACCG CGGTGACGAC 
CACCCGCTGC TGGGCAGCGG CGTCGAACTG 

30 CGGCTGGGCG CCGACACCAC CCCGTGGCTG 

CTGCCCGGCG CCGCCTTCGC CGACCTGGCG 
CCCCTCGAGC AGCTCACCCT GGCCGCGCCC 

35 CTGCGGCTGA ACGTCGGCGC CCCGGGCACC 

CGCGCCGAGG GCGCCACGGA CTGGACCCTG 
ACGGCCGACG CGCCGGACGC CTCGGCGGCC 

40 

ATCGGCGACT TCTACCAGCG CTTCTCCGAA 
GGACTGGTGA GCGCCCACCG CTGCGGCCCC 
CAGGCGCAGG GCGACGCGGC CCGCTTCGGC 

45 

CAGACCATGA GCCTCGGGGG CTTCTTCCCC 
CTGCGCGGCG TTCGGCTGTA CCGCGCCGGA 
SO GTCTCCGAGG ACGCGGTCCG CATCAGGTGC 

ATCGAGTCCT TCATCATGCG GCCGGTCGAC 
GGCGCCGACG CGCTCTTCCG CATCGCCTGG 

55 

ACCGGCGACG GCACCCCTCC CCCGGTGCGC 



GCCGCCGCCT ACGTCGCCGG CGCCCTGACG 26700 

CGCAGCCAGG CGTGGCTGCG GCTGGCCGGC 26760 

GAACGCGACC TGCGTCCCCG CCTGGAGCCC 26820 

AACGGCCCCG AGACCTGCGC CGTCTCCGGG 26880 

GAACTCGGTG CGGAGGGCGT GCACGCCCGC 26940 

TCGCCGCAGG TCGACACGCT GGAGGCCCAC 27000 

CGCACCTCCG ACATCCCGTT CTACTCGACG 27060 

CTGGACGCCG ACTACTGGTA CCGCAACATG 27120 

CGCGCCCTGA TCGCCGACGG CCACGACGTG 27180 

GCCGTCTCCC TCCAGGAGAC GATCAGCGAC 27240 

CTGCGGCGCG GCCAGGGCGG CCCCCGCTGG 27300 

CACGGCCTGG AGATCGACGC CGAGGCCATC 27360 

CCCACGTACC CCTTCCAGCG CGAGCGCTAC 27420 

CCCGCCTCCC TCGGTCTGGA CGCCGTCGAC 27480 

CCGGAGTCCG GTGACCGGAT GTACACCGCA 27540 

GCCGACCACG CGCTGCTGGG GTCGCCGCTG 27600 

CTCTGGGCCG GCCGCCAGGC CGGCACCGGC 27660 

CTGGTGCTGC CCGGCTCCGG GGGTGTCCGG 27720 

GACGACGCCC GCCGCTTCGC CGTGCACGCC 27780 

CACGCCGAGG GGCTGCTCAC CGCGCAGGAC 27840 

ACCCCGCCCC CCGGCGCCGA ACAACTGGAC 27900 

CTCGGTTACG GCTACGGCCC GTTCTTCCGG 27960 

GACATCCACG CGGAGGTCGC GCTGCCCGTC 28020 

ATCCATCCCG CGCTGCTGGA CGCGGCGCTG 28080 

GAGGACGGCC GCGTCCGCAT GCCGTTCGCC 28140 

GCCGACCGGC TGCACGTGCG CGTCTCGCCC 28200 

GCCGACGGCG AGGGACGGCC GGTCGCCGAG 28260 

CCGGGACAGC TCCTGGGCGG CCGCCCGGTC 28320 

CGGGAACTCG CCGCCGGCCC GGGCACCCGT 28380 

TGGGTGCTGG CGGGACCCGA CGCGCTGGGC 28440 
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CTCGCCOAGG 
TCCCCGACGG 

5 

GACGTCGCCG 
CTCGCGGCCC 

10 GCCGTACGCG 
CCCTCCGCGC 
GCGTCCGCCC 

15 GCCGGGACGG 
ACCCCGCCCG 
CTCGCCCTGG 

20 

GCCGTACGCG 
GGCGAGGCGG 
ACCGGTGTCG 

25 

TGCGTGGCCG 
GCCTCGGTCC 

30 AGGCCGGGGG 
CAGATCGCCC 
GCCCTGCGCG 

55 GCGACCGCCT 
ACCAAGGAGT 
CTGGGCAAGA 

40 

TACCGGGCGT 
CTCATGGACC 
CGCCGGGCCG 

45 

CTGACCATGC 
GCCCTCGGCA 
50 ATCGCCGCCC 
GCCCTGGGCG 
GGACTGCTCG 
GTCCTCGACG 



CGGCCGACGC CCACCTGCCC 
GACGCCCGGC GCCGGACCCC 
CCGACGCGCA CACCGTGGCC 
CGGAGGGCCC GGACGGCGCC 
ACGACGCCGA GGTGGACGAC 
AGGCCGAGGA GCCCCGCCGG 
GGGCGCTGAC CGACGCCCTC 
TGTACGTGCC CCGGCTGGAG 
ACGACGGTGC CTGGCGGCTG 
TGCCCGCCCC GGACGCCGAG 
CCGCGGGCGT CAACTTCCGC 
AGATGGGAAC GGAGGGCGCC 
CCGTCGGCGA CCGCGTGCTC 
ACCACCGGCT GCTCGCCCCC 
CCGCGGTGTT CCTCAGCGCC 
AGCGGGTGCT CGTGCACGCC 
GCCACCTCGG CGCGGAGGTG 
CCATGGGCAT CACCGACGAC 
TCACCGGAGC GGACGGCACG 
TCGTGGACGC CTCCCTCGGG 
CCGACGTCCG GGACCCCGAG 
TCGACCTCAA CGAGGCCGGA 
TGTTCGCCGC CGGCGTGCTG 
CGGACGCCCT GCGCACCATC 
CGCCCGCCTG GCACCCGTAC 
GCCGCATCGC CCGCCACCTG 
GCCGGGGCCC GGACGGCGAG 
CGTCGGCCAC CGTGGTCGCC 
CCGGCATACC GGCCGATCAC 
ACGGCGTGCT GCCCGGGCTC 



GCCGTTCCCG GCCCGGACGG 
GTCGTGTTCG CGGTCCGTCC 
TGCCGGGTGC TGGACCTCGT 
CGCCTGGTGG TGGCCACCCG 
CCGGCCGCGG CCGCCGCGTG 
TTCCTGCTCG TGGACCTGGA 
GCCTCCGGCG AACCGCAGAC 
CGGGCCGCCG ACCGCACGGA 
GGCCGGGGCA CCGACCTCAC 
GCGCCGCTGG AGCCCGGCCA 
GACGCCCTCA TCGCCCTCGG 
GGCACCGTCG TCGAGGTCGG 
GGCCTGTGGG ACGGCGGCCT 
GTCCCGGACG GCTGGTCCTA 
TACTACGGTC TGGTCACCCT 
GCCGCCGGGG GCGTCGGCAT 
CTGGCCACCG CGAGCCCCGG 
CACCTCGCCT CCTCCCGCAC 
TCCCGCGCGG ACGTCGTCCT 
CTGCTCCGTC CGGGCGGCCG 
CGGATCGCCG CCGAACACCC 
CCCGACGCAC TCGGCCGGCT 
CACCCGCTGC CCGTCGTCAC 
AGCCAGGCCC GGCACACCGG 
GGCACGGTCC TGGTCACCGG 
GCGAGCCGGC ACGGCGTCCG 
GGCGCCGCGG AGCTGGTCGC 
TGCGACGTCT CCGACCCGGA 
CCGCTGACGG CGGTCGTGCA 
ACCCCCGAGC GGATGCGGCG 



OGCACTGCCG 28500 

CGGGACCGCC 28560 

CCAGCGCCGG 28620 

CGGCGCGGTC 28680 

GGGCCTGCTG 28740 

CGACGACCCG 28800 

CGCGGTCCGG 28860 

CGGGCCGCTC 28920 

CCTCGACGGC 28980 

GGTGCGCGTC 29040 

CATGTACCCG 29100 

CCCCGGCGTC 29160 

GGGCCCGCTG 29220 

CGCCCAGGCC 29280 

GGCCGGCCTC 29340 

GGCCGCGGTG 29400 

CAAGTGGGAC 29460 

CCTCGACTTC 29520 

GAACTCGCTC 29580 

GTTCCTGGAG 29640 

CGGGGTGCGC 29700 

GCTGCGGGAA 29760 

CCACGACGTG 29820 

AAAGCTCGTC 29880 

TGGCACCGGC 29940 

CCGGCTGCTG 30000 

CGACCTCGCC 30060 

CGCCGTCCGC 30120 

CAGCACCGGC 30180 

CGTGCTGCGG 30240 
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CCCAAOGTGG AGGCCGCCGT CCACCTGGAC 
TTCCTCCTCT TCTCCTCCAG CGCCGGTCTG 

5 

GCGGCCAACG CCACCCTCGA CGCCCTCGCC 
GTGTCACTCG CCTGGGGTCT GTGGTCCGAC 

w GAGAGCCTCC AGCGGCGCTT CGCCCGCAGC 

GCCGCGCTGT TCGACGCCGC CCTGCGGGTC 
GACCCGGCCG CGCTGCGCGC CACCGGAAGC 

15 TCCGCCCCGG CGACCGGGTC CGCGGCCCCG 

GGGACCGTCG GCGAGCCGCT CGCCGAGCGG 
GACCGGCTGC TCGGCCTGGT CGGCGAACAC 

20 

GCCGAGGTCC GGCCCGACCG GCCGTTCCGC 
GAACTGCGCA ACCGGATGGC GGCGGTCACC 
GACCACCCCA CCCCCGCCGC GCTGTCCTCG 

25 

CCGGTCACCA CCACACCGCT GCTGTCCGAA 
CTCACCCCCG AGCACCTCGC GGAGCTCGCC 

30 CTGCGCCTGG ACGCCCTGGC CGACCGCTGG 

GACGACGACA TCACCGACGT GCTGAGCAGC 
GACGAGCGGT ACGGCACGTC GTGACCGCCG 

35 GCACCACCAT GGCGAACGAA GAGAAGCTGC 

TGCACCGGGC CACCGAGCAG CTGCGTGCCC 
TCGTCGGGGC GGCCTGCCGA CTCCCCGGCG 

40 

TGCTGCACGC CGGTGCCGAC GCGGTCGGCC 
AGGGAAGGTA CTCGCCCGAC CCCGACACGC 
TCGTGCAGGG GGCCGACCGG TTCGACCCCG 

45 

TCACCATGGA CCCCCAGCAG CGGCTGCTGC 
CCGGTCTGGA CCCCCAGTCC CTGGCGGGCA 
50 AGAGCGGCTA CCAGAAGGGC GTCGAAGGGC 

CCGGCATCGT CAGCTTCACC GCCGGCCGCG 
CGCTGACGAT CGACACGGCC TGCTCCTCGT 

55 

CACTGCGCCG GGGCGAGTGC GACCTCGCAC 



GAACTCACCC GCGACCTCGA CCTGTCGGCG 30300 

CTCGGCAGCC CGGCCCAGGG CAACTACGCG 30360 

GCCCGGCGCC GGTCCCTCGG CCTCCCGTCG 30420 

ACCAGCCGGA TGGCACACGC ACTCGACCAG 30480 

GGCTTCCCGC CCCTGTCCGC CACGCTGGGC 30540 

GACGAGGCCG TGCAGGTCCC CATGCGGTTC 30600 

GTCCCCGCCC TGCTGTCGGA CCTCGTCGGG 30660 

GCGTCCGGCC CCCTTCCGGC TCCGGACGCC 30720 

TTGGCCGGAC TCTCCGCCGA GGAACGCCAC 30780 

GTGGCCGCGG TACTGGGCCA CGGCTCCGCC 30840 

GAGGTCGGGT TCGACTCGCT CACGGCCGTG 30900 

GGGGTCAGGC TCCCCGCCAC CCTGGTCTTC 30960 

CACCTCGACG GCCTGCTGGC CCCGGCACAG 31020 

CTGGACCGCA TCGAGGAGGC CCTGGCCGCC 31080 

CCCGCCCCCG ACGACCGGGC CGAGGTCGCC 31140 

CGCGCCCTCC ACGACGGCGC GCCCGGCGCC 31200 

GCCGACGACG ACGAGATCTT CGCGTTCATC 31260 

GCCCGGAGCC CCGCCCGTCA TCGAAAGGAA 31320 

GCGCCTACCT CAAGCGCGTC ACGGGTGAGC 31380 

TGGACCGGCG GGCCCACGAG CCGATCGCGA 31440 

GCGTCGAGAG TCCGGACGAC CTGTGGGAGC 31500 

CGGCCCCCGC CGACCGCGGC TGGGACGTGG 31560 

CCGGCACCTC GTACTGCCGC GAGGGCGGCT 31620 

CCCTCTTCGG CATCTCGCCC AACGAGGCGC 31680 

TGGAGACCTC CTGGGAGGCG CTGGAGCGAG 31740 

GCCGGACCGG CGTGTTCGCC GGGGCGTGGG 31800 

TCGAAGCCGA TCTGGAGGCC CAACTCCTGG 31860 

TCGCCTACGC CCTGGGCCTG GAQCGCCCGG 31920 

CGCTGGTGGC ACTCCACCTG GCGGTGCAGT 31980 

TGGCGGGCGG CGCCACGGTC ATCGCCGACT 32040 
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TCGCGCTCTT CACCCAGTTC TCCCGGCAGC 
CCTTOGGTGA GACGGCCGAC GGCTTCGGCC 

5 

AGCGGCTGTC GGACGCCCGC CGCAACGGGC 
CCGTCAACCA GGACGGTGCG AGCAATGGGC 

10 GGGTGATCCG TGAGGCGCTG GCCGACGCGG 

AGGCGCACGG CACCGGCACG CCGCTCGGCG 
CGTACGGGCA CGAACGGACG GGCGACCCGC 

15 GGCACACCCA GGCCGCCGCC GGCGTGGCCG 

ACGGTGAGCT GCCGCGCACC CTGCACGCGT 
CGGGCGCCGT GGAGTTGCTG GACGAGGCCA 

20 

GCCGGGCGGG CATCTCCTCG TTCGGCATCA 
AGGAGCCGCC CGCCCGGCCG GAGCCCGAGG 
CCGTCCTCCC GCTGTCGGCC GCCGGCGCGC 

25 

CCGCGCACCT GGCCGGCCAC GAGGAGATCA 
CCACCCGTGC CGCGCTCTCG CACCGGGCCT 

30 TCGACAGGCT GACCGCGCTG GCGGAGGACA 

CGGGCAGCGG CCGGCCCCCC GTCTTCGTCT 
TGGGCGCCGA ACTCCTGGAC AGGGCACCGG 

35 GGGCCCTCGC GGCCCACCTC GACTGGTCGG 

CCCCGCCGAT CGACCGCGCG GACGTCGTCC 
TCGCGGCGCT GTCGGAGTCC CACGGTGTAC 

40 

GCGAGATCGC CGCCGCCCAC GCGGCCGGTG 
TCGCCGAGCG CAGCAGGCTC TGGAAGCGGC 
TGGCCCCGGC CGACCGGGTC CGCGAACTGA 

45 

CCGCCGTCAA CGGCCCCGCC TCGGTCACCG 
TCGGCGGCCG GCTCTCCGCC GCCGGGGTGC 
50 CCGGACACTC ACCCCAGGTG GAGCAGTTCC 

TCCGCCCGAC CGCCGCCCGG CTGCCCTTCT 
CCGAAGGCCT GGACGCCGCG TACTGGTACC 

55 

CCACCCTGCG GACGCTGCTG CGCGAGGGCC 



GCGGGCTCGC CCCCGACGGG CGGTGCAAGG 32100 

CCGCCGAGGG CGCGGGGATG CTGCTGGTCG 32160 

ACCCGGTGCT GGCGGTGGTG CGGGGCAGTG 32220 

TGACGGCGCC GAGTGGTCCT GCGCAGCAGC 32280 

GGCTGACGCC CGCCGACGTG GACGCGGTCG 32340 

ACCCCATCGA GGCCGGCGCG CTCATGGCGA 32400 

TGTGGCTGGG TTCGCTGAAG TCGAACATCG 32460 

GGGTGATCAA GATGGTGCTG GCGCTGCGCC 32520 

CGACGGCGTC CTCCAGGATC GAATGGGACG 32580 

GGCCCTGGCC CCGGC GTGCC GAGGGGCCGC 32640 

GCGGCACCAA CGCGCACCTC GTCATCGAGG 32700 

AGGCCGCGCA GCCGCCCGCC CCGGCCACCA 32760 

GATCCCTGCG CGAGCAGGCC CGCAGGCTCG 32820 

CCGCCGCCGA CGCCCCCCGC TCCGCCGCCA 32880 

CGGTCCTGGC CGACGACCGG CGGGCGCTGA 32940 

GGAAGGACCC CGGCGTCACC GTCGGCGAGG 33000 

TCCCGGGACA GGGCTCCCAG TGGACGGGCA 33060 

TCTTCCGCGC CAAGGCCGAG GAGTGCGCGC 33120 

TGCTCGACGT CCTGCGCGAC GCGCCCGGCG 33180 

AGCCGACCCT GTTCACCATG ATGGTCTCCC 33240 

GGCCCGCCGC CGTGGTCGGC CACTCCCAAG 33300 

CCCTGTCCCT CGACGACGCG GCCCGCGTGA 333 60 

TGGCCGGAAA CGGCGGCATG CTCTCCGTGA 33420 

TGGAGCCCTG GGCGGAGCGG ATGTCCGTGG 33480 

TGGCCGGTGA CGCGCGGGCG CTGGAGGAGT 33540 

TGCGCTGGCC CCTCGCCGGC GTCGACTTCG 33 600 

GCGCCGAGCT CCTCGACACG CTGGGCACCG 33660 

TCTCCACCGT GACCGCCGCG GCGCACGAGC 33720 

GG AACATGCG CGAACCCGTG GAGTTCGCGT 33780 

ACCGCACCTT CGTCGAGATG GGCCCGCACC 33840 
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CCCTGCTGGG CGCCGCGATC GACGAGGTCG 
TCGCCACCCT CCACCGCGGC TCCGGCGGCC 

5 

CGTTCGCCCA CGGAGTACGG GTCGACTGGG 
GGGTCCCGCT GCCCACCTAC GCCTTCAGCC 

1Q GCCGGCGCGC CGTCGAGGCG GCCCCCGTCG 

CCTGGACACC CGTGGCATCC GACGACTCCG 
AGACCCCCGG CACCGCGCCG GACGAGGCGG 

15 GGGTGGTCGT GGAGCGCTGC CTGCTGGATC 

AGCGACTGGC CGAACTGGAC GCGCAGCCGG 
GCCGTCCGCA GAGCACCGCA CCGGCCGACG 

20 TCCTGCTCGT GGTCCAGGCC GTGCCGGACG 

CGCGGGGTGC GGTCGCGGTG GGGTCGGGTG 
GGGGTCTGGG GCGGGTGGCT GCGTTGGAGG 

25 

TGGCGGTGGG GGCGGGTGTG CGTGAGTGGC 
GTGAGGATCA GGTGGCGGTG CGTGGTGGGG 

30 GGGTGCGGGG TGGTTCGGGG GTGTGGCGTG 

TGGGTGGTGT GGGGGGTCAT GTGGCGCGGT 
TGTTGGCGGG GCGTCGGGGT GGTGGGGTTG 

35 TGGGGTTGGG GGCGAAGGTG ACGTTCGTTT 

TGGGGTTGTT GGGTGTGGTG GAGGGGTTGG 
CGGGGGTGGC TCAGGTGTCG GGGTTGGGTG 

40 

TGGGGGGTAA GGCGGTGGGG GCTGAGTTGT 
ATGCGTTCGT GTTGTTCTCG TCGGGTGCTG 
ATGCGGCGGC CAATGCGCAT CTGGATGCGT 

45 

CCGCGACCTC CGTCGCCTGG GGCCTGTGGG 
TCACGGAGTT CTACGCCGAG CGCGGCCTCG 

SO CACTGCACAC GGCACTGAAC GAGGGCGACA 

AACACTTCGT CACCGGGTTC ACCGCCTACC 
AGGTCCGCGC GTTGCGCACG CCCGAACCCA 

55 GCGTCGACGC CGCCCTCACC CCGCGCGAGC 



CCGAGGCCGA GGGCGTGCAC GCCACCGCCC 33900 

TGGACCGGTT CCGCTCCTCG GTGGGCGCCG 33960 

ACGCCCTCTT CGAGGGCTCC GGCGCCCGCC 34020 

GGGACCGGTA CTGGCTGCCC ACCGCCATCG 34080 

ACGCGTCCGC CCCCGGGCGC TACCGCGTCA 34140 

GCCGGCCCTC CGGGCGCTGG CTGCTGGTGC 34200 

ACACCGCGGC GTCGGCCCTC GGTGCGGCCG 34260 

CCACCGAGGC CGCGCGCGTC ACGCTCACCG 34320 

AGGGCCTGGC CGGCGTG CTG GTGCTGCCCG 34380 

CCTCCCCGCT CGACCCGGGG ACGGCCGCCG 34440 

CCGCTCCGAA GGCCCGGATC TGGGTGGTGA 34500 

AGGTGCCGTG TGCGGTCGGT GCGCGGGTGT 34560 

TGCCGGTGCA GTGGGGTGGG TTGGTGGATG 34620 

GTCGTCTCGT GGGTGTGGTT GCGGGGGGTG 34680 

GTGTGTTCGG TCGTCGTCTG GTGGGTGTGG 34740 

CGCGGGGGTG TGTGGTGGTG ACGGGTGGGT 34800 

GGTTGGCGCG TTCGGGTGCG GAGCATGTGG 34860 

TGGGGGCGGT GGAGTTGGAG CGGGAGTTGG 34920 

CGTGTGATGT GGGGGATCGG GCGTCGATGG 34980 

GGGTGCCGTT GCGTGGTGTG TTTCATGCGG 3 5040 

AGGTGTCGTT GGCGGAGGCG GGTGGTGTGT 35100 

TGGACGAGTT GACGGCGGGT GTGGAGCTGG 3 5160 

GGGTGTGGGG GAGTGGGGGG CAGTCGGTGT 35220 

TGGCGGAGCG TCGTCGTGCG CAGGGGCGTC 35280 

GCGGCGAGGG CATGGGAGCG GACGAAGGCG 3 5340 

CCCCCATGCG GCCCGAGTCG GGCATCGAGG 3 5400 

CCTGCGTCAC GGTCGCCGAC ATCGACTGGG 3 5460 

GGCCCAGCCC GCTGATCTCC GACATCCCCC 35520 

CCGTGGACGC CTCGGACGGA CTGCGCCGGC 3 5580 

GCACCAAGGT CCTGGTCGAC CTGGTCCGCA 35640 
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CGGTGGCGGC GGAGGTCCTC GGTCACGACG GGATCGGCGG CATCGGCCAC GACGTGGCCT 35700 

TCCGGGACCT CGGCTTCGAC TCGCTGGCOG CGGTGCGGAT GCGCGGCCGG CTGGCCGAGG 35760 

5 

CGACCGGACT CGTACTGCCC GCGACGGTCA TCTTCGACCA CCCCACCGTG GACCGGCTCG 35820 

GCGGCGCGCT GCTGGAGCGG CTGTCCGCGG ACGAACCCGC GCCCGGCGGG GCGCCGGAGC 35880 

70 CCGCCGGGGG GAGGCCCGCG ACCCCACCGC CCGCACCGGA GCCGGCCGTC CACGACGCCG 35940 

ACATCGACGA ACTGGACGCG GACGCCCTCA TCCGGCTGGC CACGGGAACC GCCGGACCGG 36000 

CCGACGGCAC GCCGGCCGAC GGCGGGCCCG ACGCGGCGGC GACCGCCCCC GACGGAGCAC 3 6060 

75 CGGAGCAGTA GCGCGCCCTC ACCGGCGCGC CGACCGGCGG AGCGCCGTAC CGCCGACGCC 36120 

CCCCACAGCC AGCGAGCAGA CGAGGAAGCC GAAGATGTCA CCGTCCATCG ACGAAGTGCT 36180 

GGGTCCGCTG CGCACCTCCG TCAAGGAGAC CGAGCGGCTG CGCCGGCACA ACCGGGAGCT 3 6240 

20 

CCTGGCCGGC GCGCACGAGC CGGTCGCCAT CGTGGGCATG GCCTGCCGCT ACCCCGGTCG 3 6300 

CGTGAGCACC CCGGACGACC TGTGGGAGCT CGCCGCGGAC GGCGTCGACG CGATCACCCC 36360 

CTTCCCGGCC GACCGGGGCT GGGACGAGGA CGCCGTCTAC TCGCCCGACC CCGACACCCC 36420 

25 

CGGCACCACC TACTGCCGTG AGGGCGGCTT CCTCACCGGC GCCGGGGACT TCGACGCGGC 36480 

CTTCTTCGGC ATCTCGCCGA ACGAGGCGCT GGTGATGGAC CCGCAGCAGC GGCTGTTGCT 36540 

30 GGAGACGTCG TGGGAGACGT TGGAGCGGGC CGGCATCGTC CCCGCGTCGC TGCGCGGCAG 36600 

CCGTACCGGT GTCTTCGTCG GAGCCGCGCA CACGGGATAC GTCACCGACA CCGCGCGAGC 36660 

GCCCGAGGGC ACCGAGGGCT ATCTGCTGAC GGGCAACGCC GATGCCGTCA TGTCCGGCCG 36720 

35 GATCGCCTAC TCCCTGGGTC TGGAGGGGCC GGCGCTGACG ATCGGGACGG CCTGCTCGTC 3 6780 

GTCGTTGGTG GCGTTGCATC TGGCGGTGCA GTCGTTGCGG CGGGGCGAGT GCGACCTCGC 3 6840 

GTTGGCCGGC GGCGTCGCGG TCATGCCCGA CCCGACGGTG TTCGTGGAGT TCTCGCGGCA 3 6900 

40 

GCGGGGGCTG GCGGTGGACG GGCGGTGCAA GGCGTTCGCG GAGGGTGCGG ACGGGACGGC 3 6960 

GTGGGCGGAG GGAGTGGGTG TGCTGCTGGT GGAGCGGCTT TCCGACGCGC GCCGCAATGG 37020 

^ CCATCGGGTG CTGGCGGTGG TGCGGGGCAG TGCGGTCAAT CAGGACGGGG CGAGCAATGG 37080 

GCTGACGGCG CCGAGTGGTC CTGCGCAGCA GCGGGTGATC CGTGAGGCGC TGGCTGATGC 37140 

GGGGCTGACG CCCGCCGACG TGGATGTGGT GGAGGCGCAC GGTACGGGGA CGGCGTTGGG 37200 

50 TGATCCGATC GAGGCGGGTG CGTTGCTGGC CACGTACGGG CGGGAGCGGG TCGGTGATCC 37260 

TTTGTGGTTG GGGTCCTTCA AGTCGAACAT CGGGCATGCG CAGGCGGCTG CGGGTGTGGG 37320 

TGGTGTGATC AAGGTGGTGC AGGCGATGCG GCATGGGTCG TTGCCGCGGA CGCTGCATGT 37380 

55 

GGATGCGCCG TCGTCGAAGG TGGAGTGGGC TTCGGGTGCG GTGGAGCTGC TGACCGAGGG 37440 
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CCGGTCGTGG CCGCGGCGGG TGGAGCGGGT 
GAGCGGGACC AACGCCCATG TGGTCCTGGA 

5 

CGGGGACGGC CCCGGACCCG ACCGGCCCGA 
CTCGGCACGC TCGCGGGAGG CGCTGCGCGG 

10 CCAGGGGCGC ACGGAGGGCA CCGGCGGCGG 

CGGATACTCC CTGGCCACCA CCAGGGACAC 
GGAGAACCGG ACGGCCGGGG AGGACCTCGC 

75 CGTGGTCACG GGTGTCGCGC GACGTGGCCG 

CGCCCAGCGG CTCGGCGCCG GTCGGGAGCT 
CCTCGACGAG ATCGCGGCGG AGTTCGACGC 

20 

GTTCGCCGAG CCCGCCACGC CGGACGCCGC 
GGCCCTCTTC GCGGTGGAGA CCGCGCTCTT 
GGACGTCCTC GTGGGCCACT CGATCGGCGG 

2S 

CTCTGCGGCC GACGCGGCCC GGCTGGTCTC 
CGAGGGCGGC GCGATGGCGG CCGTGCAGGC 

30 CGTCGCCGCC GGCGGCGCGG TGGTCGCCGC 

CGGGGACGAG GCGGCCGTAC TGGCGGCGGC 
CAAGCGCCTG AGGGTGAGCC ACGCCTTCCA 

55 CTTCCGCGCG GTGGCGGACA CGGTCGACTA 

AGTGACCGGC GACCTCGCCG ACGCCGCCCA 
GGTGCGGCAG CCGGTGCGCT TCGCCGACGC 

40 

GACCTTCATC GAGCTCGGGC CCGACGCCGT 
CGCGGAGGCC GACGTCGTGT TCGCCCCGGC 

^ CGTGCTCCGG GCCGCCGCGA GCGCGTACGT 

CTACGGCGGC ACGGGAGCCC GCCGCACCGA 
CTACTGGCTC GCCCCCGCCT CGGCCGCGGT 

50 ATCCGTGCCG GAAGCCGAGC AGGACGGGGC 

CGCCTCGGCC GCGGCGCGAC TCGGCGCCGA 
GGTCCTGCCG CACCTGGCCG CCTGGCACGA 

55 

CCTGCACTAC CGCGTCACCT GGCAGGCGAT 



GCGGCGGGCC GCGGTGTCGG CGTTCGGGGT 37500 

GGAAGCACCG GTCGAGGCCG GGAGCGAGCA 37560 

CGCCGTGACG GGTCCGCTCC CCTGGGTGCT 37620 

CCAGGCCGGA CGACTCGCCG CTCTCGCCCG 37680 

CAGCGGACTC GTCGTCCCCG CGGCCGACAT 37740 

CCTGGAGCAC CGGGCGGTGG CGCTGGTGCA 37800 

CGCGCTGGCC GCCGGCCGCA CACCGGAGAG 37860 

CGGGATCGCC TTCCTCTGCT CGGGGCAGGG 37920 

CCGCGGCAGG TTCCCCGTCT TCGCCGACGC 37980 

CCACCTCGAA CGCCCTCTCC TGTCGGTGAT 38040 

ACTCCTCGAC CGCACCGACT ACACCCAGCC 38100 

CCGGCTCCTG GAGAGCTGGG GCCTGGTCCC 38160 

TCTGGTGGCG GCTCACGTGG CGGGCGTCTT 38220 

CGCACGCGGC CGGCTCATGC GGGCCCTGCC 38280 

CACCGAGCGG GAGGCCGCCG CGCTGGAGCC 38340 

GGTCAACGGC CCGCAGGCCC TCGTGCTCTC 38400 

CGGTGAACTG GCCGCCCGCG GACGCCGCAC 38460 

CTCACCCCGT ATGGACGCCA TGCTCGCCGA 38520 

CCACGCCCCC CGGCTGCCGG TCGTCTCCGA 38580 

GCTGACCGAC CCCGGCTACT GGACCCGCCA 38640 

CGTGCGCACC GCGAGCGCCC GGGACGCCGC 38700 

CCTGTGCGGC ATGGCGGAGG AGTCCCTGGC 38760 

ACTGCGCCGC GGGCGCCCGG AGGGCGACAC 38820 

CCGCGGCGCG GGCCTCGACT GGGCCGCGCT 38880 

CCTGCCCACC TACGCCTTCC AGCACAGCCG 38940 

CGCCCCCGCG ACGGCCGCCC CCTCCGTCCG 39000 

GCTGTGGGCC GCCGTGCACG CCGGTGACGT 39060 

CGACGCCGGT ATCGAACACG AACTGCGCGC 39120 

CCGCGACCGC GCGACCGCGC GGACCGCGGG 39180 

CGAGGCAGAC GCTGTCAGGT TCAGCCCCTC 39240 
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GGATCGCTGG CTGATGGTCG AGCATGGGCA 
GGCGCTGCGC GCGGCCGGCG CGGAGGTCAC 

5 

CGGATCACCG CGGACGGAGA CCCCGGACCG 
CGCACGGAGC CCGGAGGGCC TGGCCGGCGT 

10 GGTCGCCGGG CACCCCGGGC TGGACCAGGG 

ACTGACCGAC GCCGCGGTGC GGGCACCGCT 
GGGGTCGGGT GAGGTGCCGT GTGCGGTGGG 

15 TGCGTTGGAG GTGCCGGTGC AGTGGGGTGG 

GCGTGAGTGG CGTCGTGTGG TGGGTGTGGT 
GCGTGGTGCG GGTGTGTTCG GTCGTCGTCT 

20 

GGTGTGGCGT GCGCGGGGGT GTGTGGTGGT 
TGTGGCGCGG TGGTTGGCGC GTTCGGGTGC 
TGGTGGGGTT GTGGGGGCGG TGGAGTTGGA 

25 

GACGTTCGTT TCGTGTGATG TGGGGGATCG 
GGAGGGGTTG GGGGTGCCGT TGCGTGGTGT 

30 GGGGTTGGGT GAGGTGTCGT TGGCGGAGGC 

GGCTGAGTTG TTGGACGAGT TGACGGCGGG 
GTCGGGTGCT GGGGTGTGGG GGAGTGGGGG 

35 TCTGGATGCG TTGGCGGAGC GTCGTCGTGC 

GGGCCCGTGG GACGGCGACG GCATGGGCGA 
CGGCGTGGCC CCGCTCCACC CCGAGACGGC 

40 

CGGCGAAGCC ACGGTCACCG TGGCGGACAT 
CGCCTTCCGT CCCAGCCCCC TGATCGCCGG 
CGGCCGGCCC GCCGAGGACA CCCCCACCGC 

4b 

CCGGCCGCGG CTCGCCCTGG ACCTGGTGCT 
CGAGGACGCC CGGGTCGACG CCCGGGCCCC 
50 CGCGGTGCGG CTGCGCCGCC GGCTGGCCGA 

CGTCTTCGAC CACGAGGACC CCACCGCGCT 
GGGGACCCCC GGCCCCCAGG AGGGCACGGC 

55 

CCGCGCCGCC GTCGAACAGC GCAGGTCGAG 



GCACACGGAA TGCGCGGACG CCGCGGAACG 39300 

CCGCCTGGTG TGGCCGCTGG AGCAGCACAC 39360 

CGGCACCCTG GCGGCCCGGC TGGCCGAGCT 39420 

GCTGCTGCTC CCCGACTCGG GCGGTGCCGC 39480 

AACGGCGGCG GTGCTGCTGA CGATCCAGGC 39540 

GTGGGTGGTG ACGCGGGGTG CGGTGGCGGT 39600 

TGCGCGGGTG TGGGGTCTGG GGCGGGTCGC 39660 

GTTGGTGGAT GTGGCGGTGG GGGOGGGTGT 39720 

TGCGGGGGGT GGTGAGGATC AGGTGGCGGT 39780 

GGTGGGTGTG GGGGTGCGGG GTGGTTCGGG 39840 

GACGGGTGGG TTGGGTGGTG TGGGGGGTCA 39900 

GGAGCATGTG GTGTTGGCGG GGCGTCGGGG 39960 

GCGGGAGTTG GTGGGGTTGG GGGCGAAGGT 40020 

GGCGTCGGTG GTGGGGTTGT TGGGTGTGGT 40080 

GTTTCATGCG GCGGGGGTGG CTCAGGTGTC 40140 

GGGTGGTGTG TTGGGGGGTA AGGCGGTGGG 40200 

TGTGGAGCTG GATGCGTTCG TGTTGTTCTC 40260 

GCAGTCGGTG TATGCGGCGG CCAATGCGCA 40320 

GCAGGGGCGT CCCGCGACCT CCGTCGCCTG 40380 

GATGGCGCCC GAGGGCTACT TCGCCCGCCA 40440 

GCTCACCGCC CTGCACCAGG CCATCGACGG 40500 

CGACTGGGAA CGGTTCGCCC CCGGCTTCAC 40560 

CATCCCCGCG GCCCGTACGG CGCCCGCCGC 40620 

CCCCGGCCTC CTGCGGGCGC GGCCCGAGGA 40680 

CCGCCACGTC GCGGCGGTCC TCGGCCACTC 40740 

CTTCCGGGAC CTCGGCTTCG ACTCGCTCGC 40800 

GGACACCGGG CTCGACCTGC CCGGCACCCT 40860 

GGCCCACCAC CTGGCCGGCC TCGCCGACGC 40920 

TCGGGCCGAG AGCGGGCTGT TCGCCTCCTT 40980 

CGAGGTCGTG GAGCTGATGG CCGACCTGGC 41040 
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GGCGTTCCGG CCCGCCTACT CCCGGCAGCA CCCCGGCTCC GGCCGCCCCG CGCCCGTACC 41100 

CCTCGCGACC GGACCCGCGA CGCGTCCCAC GCTCTACTGC TGCGCCGGCA CCGCGGTCOG 41160 

5 

CTCCGGGCCC GCCGAGTACG TCCCGTTCGC CGAAGGACTG CGCGGCGTCC GGGAGACGGT 41220 

CGCCCTTCCC CTGTCCGGCT TCGGCGACCC CGCGG AACCG ATGCCCGCAT OGCTCGACGC 41280 

10 GCTGATCGAG GTCCAGGCCG ACGTCCTCCT GGAGCACACC GCGGGCAAGC CCTTCGCCCT 41340 

CGCCGGCCAC TCCGCCGGCG CGAACATCGC CCACGCCCTG GCCGCCCGGC TGGAGGAACG 41400 

CGGCTCGGGC CCCGCAGCCG TCGTACTGAT GGACGTCTAC CGTCCCGAGG ACCCCGGTGC 41460 

15 GATGGGCGAG TGGCGCGACG ACCTGCTCAG CTGGGCGCTC GAACGCAGCA CGGTGCCCCT 41520 

GGAGGACCAC CGGCTCACCG CCATGGCCGG CTATCAGCGG CTGGTGCTCG GAACCCGGCT 41580 

CACCGCCCTC GAAGCCCCCG TCCTGCTGGC CCGGGCGTCC GAACCCCTGT GCGCGTGGCC 41640 

20 

GCCCGCGGGC GGGGCGCGGG GCGACTGGCG GTCCCAGGTC CCGTTCGCAC GGACCGTCGC 41700 

CGACGTGCCC GGCAACCACT TCACCATGCT CACCGAACAC GCCCGGCACA CCGCGTCCCT 41760 

GGTGCACGAA TGGCTGGACA GCCTCCCGCA CCAGCCCGGT CCCGCCCCGC TCACCGGAGG 41820 

25 

GAAACACTGA TGTACGCCGA CGACATCGCG GCCGTCTACG ACCTGGTCCA CGAGGGGAAG 41880 

GGGAAGGACT ACCGGCAGGA GGCCGAGGAG ATCGCCGCAC TCGTGCGCGT CCACCGGCCG 41940 

30 GGCGCCCGGA CCCTGCTCGA CGTGGCCTGC GGCACCGGCC AGCACCTGCA CCACCTCGAC 42000 

GGCCTCTTCG ACCACGTCGA GGGCCTGGAA CTCTCCGCCG ACATGCTGGC CCTCGCGACC 42060 

GGCCGGAACC CCGGTGTCAC CTTCCACCAA GGGGACATGC GCTCGTTCTC CCTGGGACGC 42120 

35 CGGTTCGACG CGGTGACCTG CATGTTCAGC TCCATAGGCC ACCTGCGGAC CACCGACGAA 42180 

CTCGACAGCA CGCTGCGGGC CTTCACCGAC CACCTCGAAC CGTCCGGCGT CATCGTCGTC 42240 

GAACCCTGGT GGTTCCCCGA GTCCTTCACC CCCGGTTACG TCGGCGCCAG CATCACGGAG 42300 

40 

GCGGGCGAGC GCACCGTCTG CCGGGTCTCG CACTCCGTAC GGGAGGGGAA CGCCACCCGC 42360 

ATCGAGGTGC ACTACCTCCT CGCCGGACCC GGCGGCGTCC GTCACCTGAC CGAGGACCAC 42420 

45 ACCATCACCC TGTTCCCGCG CGCCGACTAC GAGGCGGCCT TCGAGCGCGC CGGCTGCGAC 42480 

GTGGTCTACC AGGAAGGCGG CCCGTCCGGT CGCGGGCTGT TCATCGGCAC CCGCCGCTCA 42540 

CCCGGTGCCG ACGCGGACCG CCGCGGCCCG GAGGCGGGTT GCCCCGACCC ACCCGGCACA 42600 

SO CCCGGGTCCC CCGATCGTGC GAGCGCCCCC ATCGACCCGA GAAGAAAGGC AGGGCAGCCA 42660 

TGCCCACCCT TGCCACGGAA ACGGCCCCCG CGAGCACCAG CACGAGCGCG GGCACGAGCA 42720 

CGGGCGTCCG TGCGCTCGGC CGTCGGCTCC AGCTGACCCG GGCCGCACAC TGGTGCGCCG 42780 

55 

GCAACCAGGG CGACCCGTAC GCGCTGATCC TGCGCGCCGT CGCCGACCCC GAGCCGTTCG 42840 
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AACGGGAGAT CCGGGCCCGC GGACCGTCGT TCCGCAGCGA ACAGCTGGAC GCCTGGGTGA 42900 

CCGCGGACCC CGAGGTGGCG GCGGCCGTCC TGGCCGACCC GCGCTTCGGC ACCCTCGACC 42960 

GGGCCGGACG CCGCCCGGAC GAGGAACTGC TCCCCCTCGC CGAGGCGTTC CCCCACCACG 43020 

AACGCGCGGA GCTCGTACGC CTGCGGGCGC TGGCCGCCCC GGTGCTCAGC CGGTACGCCC 43080 

CGGCCCAGGC GCCCTGCGCG GCGCGCACCA CCGCCCGCAG AGTGCTCGGC CGCCTCCTGC 43140 

CCACCGGTGA CGCCGGGTTC GACCTTGTCG GCGAGGTCCC CCGGCCCTAC GCCGTCGAGC 43200 

TGATGCTCAG GCTCCTCGGA GTGCCGGGCC GCGACCGCGC CACCGCCGCG CGGGCACTCG 43260 

CCGCCTGCGG CCCCCAGCTC GACGCCCGGA TGGCCCCGCA ACTGCTGACC GTCGCCCGGG 43320 

AGTCCGCCGA CGCCGTCCGC ACACTGGCCG ACCTGGTCCC CGAGCTCGTC GCGGAGAAGT 43380 

CCCGGGGCCT CGGGAACGCC GAGCCCCGGC CCGACGACGT GCTCGCCCTC CTCCTGCACG 43440 

ACGGCGTCGC CCCCGGCGAC GTCGAGCGCA TCGCGCTGCT CCTCGCGGTC GGCGCACCCG 43500 

AACCCGTCGT CACCGCCGTC GCGCACACGG TCCACCGGCT GCTCGGCCGG CCGGGGGAGT 43560 

GGGAGAGGGC CCGCCGGACG CCGGCCGCGG CGAACGCCGT CGACCAGGTG CTGCGCGAGC 43620 

GCCCCCCGGC CCGGCTGCAG AACCGGGTCG CGCACACCGG CCTCGAACTC GGCGGCCGCC 43 680 

GGATCACCGC CGACGAGCAC GTCGTGGTGC TGGCCGCCGC CGGACGGGAG ATCCCCGGGC 43740 

CGGAGCCGCT CGGGGGCGCC GACGGACCGC ACCTGGCGCT CGCCCTCCCG CTCATCCGCC 43800 

TGGCCGCCAC CACCGCGGTC CAGGTCACGG CCGGCCGCCT GCCCGGCCTG CGGGCCGAGG 43860 

GACCGCCCCT GACCCGGCCG CGGTCACCGG TCCTGGGCGC CTGCGCCCGC CTCCGGGTCC 43920 

ACCCGGGATG ACCCCGCCGT CCGTACGCCC CCTCCCAGAC CGGAGCCGCT GTCCGCGTCC 43980 

TGCTGACATC CCTCGCCCAC AACACCCACT ACTACAGTCT GGTGCCCCTC GCCTGGGCGC 44040 

TGCGCGCCGC CGGGCACGAG GTACGGGTGG CGAGCCCGCC CTCCCTCACC GACGTCATCA 44100 

CCTCCACCGG TCTGACCGCC GTACCGGTGG GCGACGACCG ACCGGCCGCG GAGCTGCTCG 44160 

CCGAGATGGG CAGAGACCTC GTCCCCTACC AGAGGGGCTT CGAGTTCGGT GAGGTGGAGA 44220 

3 GCG AGGAGG A GACCACCTGG GAGTACCTGC TCGGCCAGCA GAGCATCATC GCCGCCCTCT 44280 

GCTTCGCCCC GTTCAACGGC GCCGCCACGA TGGACGAGAT CGTCGACTTC GCCCGTCGCT 44340 

GGCGGCCCGA CCTGGTCGTG TGGGAACCCT GGACCTA 44377 

(2) INFORMATION FOR SEQ ID NO:8: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4550 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

Met Ser Gly Glu Leu Ala lie Ser Arg Ser Asp Asp Arg Ser Asp Ala 
1 5 10 15 

Val Ala Val Val Gly Met Ala Cys Arg Phe Pro Gly Ala Pro Gly He 
20 25 30 

Ala Glu Phe Trp Lys Leu Leu Thr Asp Gly Arg Asp Ala He Gly Arg 
35 40 45 

Asp Ala Asp Gly Arg Arg Arg Gly Met He Glu Ala Pro Gly Asp Phe 
50 55 60 

Asp Ala Ala Phe Phe Gly Met Ser Pro Arg Glu Ala Ala Glu Thr Asp 
65 70 75 80 

Pro Gin Gin Arg Leu Met Leu Glu Leu Gly Trp Glu Ala Leu Glu Asp 
85 90 95 

Ala Gly He Val Pro Gly Ser Leu Arg Gly Glu Ala Val Gly Val Phe 
100 105 110 

Val Gly Ala Met His Asp Asp Tyr Ala Thr Leu Leu His Arg Ala Gly 
115 120 125 

Ala Pro Val Gly Pro His Thr Ala Thr Gly Leu Gin Arg Ala Met Leu 

30 130 135 140 

Ala Asn Arg Leu Ser Tyr Val Leu Gly Thr Arg Gly Pro Ser Leu Ala 

145 150 155 160 



w 



15 



20 



25 



35 



Val Asp Thr Ala Gin Ser Ser Ser Leu Val Ala Val Ala Leu Ala Val 
165 170 175 

Glu Ser Leu Arg Ala Gly Thr Ser Arg Val Ala Val Ala Gly Gly Val 
180 185 190 

Asn Leu Val Leu Ala Asp Glu Gly Thr Ala Ala Met Glu Arg Leu Gly 
40 1 9 5 2 0 0 2 0 5 

Ala Leu Ser Pro Asp Gly Arg Cys His Thr Phe Asp Ala Arg Ala Asn 
210 215 220 



45 



Gly Tyr Val Arg Gly Glu Gly Gly Ala Ala Val Val Leu Lys Pro Leu 
225 230 235 240 

Ala Asp Ala Leu Ala Asp Gly Asp Pro Val Tyr Cys Val Val Arg Gly 
245 250 255 

50 Val Ala Val Gly Asn Asp Gly Gly Gly Pro Gly Leu Thr Ala Pro Asp 

260 265 270 

Arg Glu Gly Gin Glu Ala Val Leu Arg Ala Ala Cys Ala Gin Ala Arg 
275 280 285 

55 

Val Asp Pro Ala Glu Val Arg Phe Val Glu Leu His Gly Thr Gly Thr 
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290 



295 



300 



Pro Val Gly Asp Pro Val Glu Ala His Ala Leu Gly Ala Val His Gly 
305 310 315 320 

Ser Gly Arg Pro Ala Asp Asp Pro Leu Leu Val Gly Ser Val Lys Thr 
325 330 335 

Asn He Gly His Leu Glu Gly Ala Ala Gly He Ala Gly Leu Val Lys 
340 345 350 

Ala Ala Leu Cys Leu Arg Glu Arg Thr Leu Pro Gly Ser Leu Asn Phe 
355 360 365 

Ala Thr Pro Ser Pro Ala He Pro Leu Asp Gin Leu Arg Leu Lys Val 
370 375 380 

Gin Thr Ala Ala Ala Glu Leu Pro Leu Ala Pro Gly Gly Ala Pro Leu 
385 390 395 400 

Leu Ala Gly Val Ser Ser Phe Gly He Gly Gly Thr Asn Cys His Val 
405 410 415 

Val Leu Glu His Leu Pro Ser Arg Pro Thr Pro Ala Val Ser Val Ala 
420 425 430 

Ala Ser Leu Pro Asp Val Pro Pro Leu Leu Leu Ser Ala Arg Ser Glu 
435 440 445 

Gly Ala Leu Arg Ala Gin Ala Val Arg Leu Gly Glu Tyr Val Glu Arg 
450 455 460 

Val Gly Ala Asp Pro Arg Asp Val Ala Tyr Ser Leu Ala Ser Thr Arg 
465 470 475 480 

Thr Leu Phe Glu His Arg Ala Val Val Pro Cys Gly Gly Arg Gly Glu 

485 490 * 495 

Leu Val Ala Ala Leu Gly Gly Phe Ala Ala Gly Arg Val Ser Gly Gly 

500 505 510 

Val Arg Ser Gly Arg Ala Val Pro Gly Gly Val Gly Val Leu Phe Thr 
515 520 525 

Gly Gin Gly Ala Gin Trp Val Gly Met Gly Arg Gly Leu Tyr Ala Gly 
530 535 540 

Gly Gly Val Phe Ala Glu Val Leu Asp Glu Val Leu Ser Met Val Gly 
545 550 555 560 

Glu Val Asp Gly Arg Ser Leu Arg Asp Val Met Phe Gly Asp Val Asp 
565 570 * 575 

Val Asp Ala Gly Ala Gly Ala Asp Ala Gly Ala Gly Ala Gly Ala Gly 
580 585 590 

Val Gly Ser Gly Ser Gly Ser Val Gly Gly Leu Leu Gly Arg Thr Glu 
595 600 605 



Phe Ala Gin Pro Ala Leu Phe Ala Leu Glu Val Ala Leu Phe Arg Ala 
610 615 620 
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Leu 
625 


Glu 


Ala 


Arg 


Gly 


Val 
630 


Glu 


Val 


Ser Val Val 
635 


Leu 


Gly 


His 


Ser 


Val 
640 


Gly 


Glu 


Val 


Ala 


Ala 
645 


Ala 


Tyr 


Val 


Ala Gly Val 
650 
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Ser 
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Gly 
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Asp 


Ala 


Val 


Arg 
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660 


Val 
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Ala 


Arg 


Gly Gly Leu 
665 


Met 


Gly 


Gly 
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Val 


Gly 


Gly 
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Gly 


Met 
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Ser 


Val 
680 


Gly Ala Ser 


Glu 
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Val 
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Gly 


Val 
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Val 


Glu 
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Gly 
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Glu 


Trp Val Ser 


Val 
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Val 


Asn 


Gly 
705 


Pro 


Arg 


Ser 


Val 


Val 
710 
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Ser 


Gly Asp Val 
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Gly 


Val 
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Glu 


Ser 
720 


Val 


Val 


Ala 


Ser 


Leu 
725 


Met 


Gly 


Asp 


Gly Val Glu 
730 


Cys 


Arg 


Arg 


Leu 
735 


Asp 


Val 


Ser 


His 


Gly 
740 


Phe 


His 


Ser 


Val 


Leu Met Glu 
745 


Pro 


Val 


Leu 
750 


Gly 


Glu 


Phe 


Arg 


Gly 
755 


Val 


Val 


Glu 


Ser 


Leu 
760 


Glu Phe Gly 


Arg 


Val 
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Arg 


Pro 


Gly 


Val 


Val 
770 


Val 


Val 


Ser 


Gly 


Val 
775 


Ser 


Gly Gly Val 


Val 
780 


Gly 


Ser 


Gly 


Glu 


Leu 
785 


Gly 


Asp 


Pro 


Gly 


Tyr 
790 


Trp 


Val 


Arg His Ala 
795 


Arg 


Glu 


Ala 


Val 


Arg 
800 


Phe 


Ala 


Asp 


Gly 


Val 
805 


Gly 


Val 


Val 


Arg Gly Leu 
810 


Gly 


Val 


Gly 


Thr 
815 


Leu 


Val 


Glu 


Val 


Gly 
820 


Pro 


His 


Gly 


Val 


Leu Thr Gly 

825 


Met 


Ala 


Gly 
830 


Glu 


Cys 


Leu 
Arg 


Gly 

Ala 
850 


Ala 
835 
Glu 


Gly 
Arg 


Asp 
Glu 


Asp 
Val 


Val 

Phe 
855 


Val 
840 
Glu 


Val Val Pro 
Ala Ala Leu 


Ala 

Ala 
860 


Met 
845 
Thr 


Arg 
Val 


Arg 
Phe 


Gly 
Thr 


Arg 
865 


Asp 


Ala 


Gly 


Leu 


Asp 
870 


Ala 


Thr 


Ala Leu His 
875 


Thr 


Gly 


Ser 


Thr 


Gly 
880 


Arg 


Arg 


He 


Asp 


Leu 
885 


Pro 


Thr 
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Pro Phe Gin 
890 


Arg 


Arg 


Thr 


His 
895 
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Pro 
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900 


Ser 
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905 
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910 
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Thr 
915 


Asp 
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Val 


Gly 


His 
920 


Ser Val Ser 
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925 
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Ser 
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Ser 


His 
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935 
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Ser 
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Ser Pro Glu Pro Met Ser Pro Glu Asp Ala Val Arg Leu Val Arg Glu 
945 950 955 960 

Ser Thr Ala Ala Val Leu Gly His Asp Asp Pro Gly Glu Val Ala Leu 
965 970 975 

Asp Arg Thr Phe Thr Ser Gin Gly Met Asp Ser Val Thr Ala Val Glu 
980 985 990 

Leu Cys Asp Leu Leu Lys Gly Ala Ser Gly Leu Pro Leu Ala Ala Thr 
995 1000 1005 

Leu Val Tyr Asp Leu Pro Thr Pro Arg Ala Val Ala Glu His lie Val 
1010 1015 1020 

Glu Ala Ala Gly Gly Pro Lys Asp Ser Val Ala Gly Gly Pro Gly Val 
1025 1030 1035 1040 

Leu Ser Ser Ala Ala Val Gly Val Ser Asp Ala Arg Gly Gly Ser Arg 
1045 1050 1055 

Asp Asp Asp Asp Pro lie Ala He Val Gly Val Gly Cys Arg Leu Pro 
1060 1065 3 1070 

Gly Gly Val Asp Ser Arg Ala Ala Leu Trp Glu Leu Leu Glu Ser Gly 
1075 1080 1085 

Ala Asp Ala lie Ser Ser Phe Pro Thr Asp Arg Gly Trp Asp Leu Asp 
1090 1095 1100 

Gly Leu Tyr Asp Pro Glu Pro Gly Thr Pro Gly Lys Thr Tyr Val Arg 
H05 1110 H15 H20 

Glu Gly Gly Phe Leu His Ser Ala Ala Glu Phe Asp Ala Glu Phe Phe 
1125 1130 1135 

Gly He Ser Pro Arg Glu Ala Thr Ala Met Asp Pro Gin Gin Arg Leu 
1140 H45 1150 

Leu Leu Glu Ala Ser Trp Glu Ala Leu Glu Asp Ala Gly Val Leu Pro 
1155 H60 H65 

Glu Ser Leu Arg Gly Gly Asp Ala Gly Val Phe Val Gly Ala Thr Ala 
1170 H75 H80 

Pro Glu Tyr Gly Pro Arg Leu His Glu Gly Ala Asp Gly Tyr Glu Gly 
H85 1190 H95 ^ 1200 

Tyr Leu Leu Thr Gly Thr Thr Ala Ser Val Ala Ser Gly Arg lie Ala 
1205 1210 1215 

Tyr Thr Leu Gly Thr Gly Gly Pro Ala Leu Thr Val Asp Thr Ala Cys 
1220 1225 1230 

Ser Ser Ser Leu Val Ala Leu His Leu Ala Val Gin Ala Leu Arg Arg 
1235 1240 1245 

Gly Glu Cys Gly Leu Ala Leu Ala Gly Gly Ala Thr Val Met Ser Gly 
1250 1255 1260 

Pro Gly Met Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp 
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1265 1270 1275 1280 

Gly Arg Cys Met Pro Phe Ser Ala Asp Ala Asp Gly Thr Ala Trp Ser 
1285 1290 1295 

Glu Gly Val Ala Val Leu Ala Leu Glu Arg Leu Ser Asp Ala Arg Arg 
1300 1305 1310 

Ala Gly His Arg Val Leu Gly Val Val Arg Gly Ser Ala Val Asn Gin 
1315 1320 1325 

Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Arg Ser Ala Gin Glu 
1330 1335 1340 

Gly Val He Arg Ala Ala Leu Ala Asp Ala Gly Leu Ala Pro Gly Asp 
1345 1350 1355 1360 

Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Ala Leu Gly Asp Pro 
1365 1370 1375 

He Glu Ala Ser Ala Leu Leu Ala Thr Tyr Gly Arg Glu Arg Val Gly 
1380 1385 ~ 1390 

Asp Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn Val Gly His Thr Gin 
1395 1400 1405 

Ala Ala Ala Gly Ala Ala Gly Val Val Lys Met Leu Leu Ala Leu Glu 
1410 1415 1420 

His Gly Thr Leu Pro Arg Thr Leu His Ala Asp Arg Pro Ser Thr His 
1425 1430 1435 1440 

Val Asp Trp Ser Ser Gly Thr Val Ala Leu Leu Ala Glu Ala Arg Arg 
1445 1450 1455 

Trp Pro Arg Arg Ser Asp Arg Pro Arg Arg Ala Ala Val Ser Ser Phe 
1460 1465 1470 

Gly He Ser Gly Thr Asn Ala His Leu He lie Glu Glu Ala Pro Glu 
1475 1480 1485 

Trp Val Glu Asp He Asp Gly Val Ala Ala Pro Asp Arg Gly Thr Ala 
1490 1495 1500 

Asp Ala Ala Ala Pro Ser Pro Leu Leu Leu Ser Ala Arg Ser Glu Gly 
1505 1510 1515 " 1520 

Ala Leu Arg Ala Gin Ala Val Arg Leu Gly Glu Tyr Val Glu Arg Val 
1525 1530 1535 

Gly Ala Asp Pro Arg Asp Val Ala Tyr Ser Leu Ala Ser Thr Arg Thr 

1540 1545 1550 

Leu Phe Glu His Arg Ala Val Val Pro Cys Gly Gly Arg Gly Glu Leu 
1555 1560 1565 

Val Ala Ala Leu Gly Gly Phe Ala Ala Gly Arg Val Ser Gly Gly Val 
1570 1575 1580 

Arg Ser Gly Arg Ala Val Pro Gly Gly Val Gly Val Leu Phe Thr Gly 
1585 1590 1595 1600 
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Gin Gly Ala Gin Trp Val Gly Met Gly Arg Gly Leu Tyr Ala Gly Gly 
1605 1610 1615 

Gly Val Phe Ala Glu Val Leu Asp Glu Val Leu Ser Met Val Gly Glu 
1620 1625 1630 

Val Asp Gly Arg Ser Leu Arg Asp Val Met Phe Gly Asp Val Asp Val 
1635 1640 * 1645 

Asp Ala Gly Ala Gly Ala Asp Ala Gly Ala Gly Ala Gly Ala Gly Val 
1650 1655 1660 

Gly Ser Gly Ser Gly Ser Val Gly Gly Leu Leu Gly Arg Thr Glu Phe 
1665 1670 1675 1680 

Ala Gin Pro Ala Leu Phe Ala Leu Glu Val Ala Leu Phe Arg Ala Leu 
1685 1690 1695 

Glu Ala Arg Gly Val Glu Val Ser Val Val Leu Gly His Ser Val Gly 
1700 1705 1710 

Glu Val Ala Ala Ala Tyr Val Ala Gly Val Leu Ser Leu Gly Asp Ala 
1715 1720 1725 

Val Arg Leu Val Val Ala Arg Gly Gly Leu Met Gly Gly Leu Pro Val 
1730 1735 1740 

Gly Gly Gly Met Trp Ser Val Gly Ala Ser Glu Ser Val Val Arg Gly 
1745 1750 1755 1760 

Val Val Glu Gly Leu Gly Glu Trp Val Ser Val Ala Ala Val Asn Gly 
1765 1770 1775 

Pro Arg Ser Val Val Leu Ser Gly Asp Val Gly Val Leu Glu Ser Val 
1780 1785 1790 

Val Ala Ser Leu Met Gly Asp Gly Val Glu Cys Arg Arg Leu Asp Val 
1795 1800 1805 

Ser His Gly Phe His Ser Val Leu Met Glu Pro Val Leu Gly Glu Phe 
1810 1815 1820 

Arg Gly Val Val Glu Ser Leu Glu Phe Gly Arg Val Arg Pro Gly Val 
1825 1830 1835 1840 

Val Val Val Ser Gly Val Ser Gly Gly Val Val Gly Ser Gly Glu Leu 
1845 1850 1855 

Gly Asp Pro Gly Tyr Trp Val Arg His Ala Arg Glu Ala Val Arg Phe 
1860 1865 1870 

Ala Asp Gly Val Gly Val Val Arg Gly Leu Gly Val Gly Thr Leu Val 
1875 1880 1885 

Glu Val Gly Pro His Gly Val Leu Thr Gly Met Ala Gly Glu Cys Leu 

1890 1895 1900 

Gly Ala Gly Asp Asp Val Val Val Val Pro Ala Met Arg Arg Gly Arg 

1905 1910 1915 1920 
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Ala Glu Arg Glu Val Phe Glu Ala Ala Leu Ala Thr Val Phe Thr Arg 
1925 1930 1935 

Asp Ala Gly Leu Asp Ala Thr Ala Leu His Thr Gly Ser Thr Gly Arg 
1940 1945 1950 

Arg lie Asp Leu Pro Thr Tyr Pro Phe Gin Arg Asp Arg Tyr Trp Leu 
1955 1960 1965 

Asp Pro Val Arg Thr Ala Val Thr Gly Val Glu Pro Ala Gly Ser Pro 
1970 1975 1980 

Ala Asp Ala Arg Ala Thr Glu Arg Gly Arg Ser Thr Thr Ala Gly He 
1985 1990 1995 2000 

Arg Tyr Arg Val Ala Trp Gin Pro Ala Val Val Asp Arg Gly Asn Pro 
2005 2010 2015 

Gly Pro Ala Gly His Val Leu Leu Leu Ala Pro Asp Glu Asp Thr Ala 
2020 2025 2030 

Asp Ser Gly Leu Ala Pro Ala He Ala Arg Glu Leu Ala Val Arg Gly 
2035 2040 2045 

Ala Glu Val His Thr Val Ala Val Pro Val Gly Thr Gly Arg Glu Ala 
2050 2055 2060 

Ala Gly Asp Leu Leu Arg Ala Ala Gly Asp Gly Ala Ala Arg Ser Thr 
2065 2070 2075 2080 

Arg Val Leu Trp Leu Ala Pro Ala Glu Pro Asp Ala Ala Asp Ala Val 
2085 2090 2095 

Ala Leu Val Gin Ala Leu Gly Glu Ala Val Pro Glu Ala Pro Leu Trp 
2100 2105 2110 

He Thr Thr Arg Glu Ala Ala Ala Val Arg Pro Asp Glu Thr Pro Ser 
2115 2120 2125 

Val Gly Gly Ala Gin Leu Trp Gly Leu Gly Gin Val Ala Ala Leu Glu 
2130 2135 2140 

Leu Gly Arg Arg Trp Gly Gly Leu Ala Asp Leu Pro Gly Ser Ala Ser 
2145 2150 2155 2160 

Pro Ala Val Leu Arg Thr Phe Val Gly Ala Leu Leu Ala Gly Gly Glu 
2165 2170 2175 

Asn Gin Phe Ala Val Arg Pro Ser Gly Val His Val Arg Arg Val Val 
2180 2185 2190 

Pro Ala Pro Val Pro Val Pro Ala Ser Ala Arg Thr Val Thr Thr Ala 
2195 2200 2205 

Pro Ala Thr Ala Val Gly Glu Asp Ala Arg Asn Asp Thr Ser Asp Val 
2210 2215 2220 

Val Val Pro Asp Asp Arg Trp Ser Ser Gly Thr Val Leu He Thr Gly 
2225 2230 2235 2240 
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Gly Thr Gly Ala Leu Gly Ala Gin Val Ala Arg Arg Leu Ala Arg Ser 

2245 2250 2255 

Gly Ala Ala Arg Leu Leu Leu Val Gly Arg Arg Gly Ala Ala Gly Pro 
2260 2265 2270 

Gly Val Gly Glu Leu Val Glu Glu Leu Thr Ala Leu Gly Ser Glu Val 
2275 2280 2285 

Ala Val Glu Ala Cys Asp Val Ala Asp Arg Asp Ala Leu Ala Ala Leu 
2290 2295 2300 

Leu Ala Gly Leu Pro Glu Glu Arg Pro Leu Val Ala Val Leu His Ala 
2305 2310 2315 2320 

Ala Gly Val Leu Asp Asp Gly Val Leu Asp Ser Leu Thr Ser Asp Arg 
2325 2330 2335 

Val Asp Ala Val Leu Arg Asp Lys Val Thr Ala Ala Arg His Leu Asp 
2340 2345 2350 

Glu Leu Thr Ala Asp Leu Pro Leu Asp Ala Phe Val Leu Phe Ser Ser 
2355 2360 2365 

He Val Gly Val Trp Gly Asn Gly Gly Gin Ala Val Tyr Ala Ala Ala 
2370 2375 2380 

Asn Ala Ala Leu Asp Ala Leu Ala Gin Arg Arg Arg Ala Arg Gly Ala 
2385 2390 2395 2400 

Arg Ala Ala Ser He Ala Trp Gly Pro Trp Ala Gly Ala Gly Met Ala 
2405 2410 2415 

Ser Gly Thr Ala Ala Lys Ser Phe Glu Arg Asp Gly Val Thr Ala Leu 
2420 2425 2430 

Asp Pro Glu Arg Ala Leu Asp Val Leu Asp Asp Val Val Gly Ala Gly 
2435 2440 2445 

Gly Thr Ser Ala Ala Gly Thr His Ala Ala Gly Glu Ser Ser Leu Leu 
2450 2455 2460 

Val Ala Asp Val Asp Trp Glu Thr Phe Val Gly Arg Ser Val Thr Arg 
2465 2470 2475 2480 

Arg Thr Trp Ser Leu Phe Asp Gly Val Ser Ala Ala Arg Ser Ala Arg 
2485 2490 2495 

Ala Gly His Ala Ala Asp Asp Arg Ala Ala Leu Thr Pro Gly Thr Arg 
2500 2505 2510 

Pro Gly Asp Gly Ala Pro Gly Gly Ser Gly Gin Asp Gly Gly Glu Gly 
2515 2520 2525 

Arg Pro Trp Leu Ser Val Gly Pro Ser Pro Ala Glu Arg Arg Arg Ala 
2530 2535 2540 

Leu Leu Thr Leu Val Arg Ser Glu Ala Ala Gly He Leu Arg His Ala 
2545 2550 2555 2560 

Ser Ala Asp Ala Val Asp Pro Glu Leu Ala Phe Arg Ser Ala Gly Phe 
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2565 2570 2575 

Asp Ser Leu Thr Val Leu Glu Leu Arg Asn Arg Leu Thr Ala Ala Thr 
2580 2585 2590 

Gly Leu Asn Leu Pro Asn Thr Leu Leu Phe Asp His Pro Thr Pro Leu 

2595 2600 2605 

Ser Leu Ala Ser His Leu His Asp Glu Leu Phe Gly Pro Asp Ser Glu 
2610 2615 2620 

Ala Glu Pro Ala Ala Ala Ala Pro Thr Pro Val Met Ala Asp Glu Arg 
2625 2630 2635 2640 

Glu Pro He Ala He Val Gly Met Ala Cys Arg Tyr Pro Gly Gly Val 
2645 2650 2655 

Ala Ser Pro Asp Asp Leu Trp Asp Leu Val Ala Gly Asp Gly His Thr 
2660 2665 2670 

Leu Ser Pro Phe Pro Ala Asp Arg Gly Trp Asp Val Glu Gly Leu Tyr 
2675 2680 2685 

Asp Pro Glu Pro Gly Val Pro Gly Lys Ser Tyr Val Arg Glu Gly Gly 
2690 2695 2700 

Phe Leu Arg Ser Ala Ala Glu Phe Asp Ala Glu Phe Phe Gly He Ser 
2705 2710 2715 2720 

Pro Arg Glu Ala Thr Ala Met Asp Pro Gin Gin Arg Leu Leu Leu Glu 
2725 2730 2735 

Thr Ser Trp Glu Ala Leu Glu Arg Ala Gly He Val Pro Asp Ser Leu 
2740 2745 2750 

Arg Gly Thr Arg Thr Gly Val Phe Ser Gly lie Ser Gin Gin Asp Tyr 
2755 2760 2765 

Ala Thr Gin Leu Gly Asp Ala Ala Asp Thr Tyr Gly Gly His Val Leu 
2770 2775 2780 

Thr Gly Thr Leu Gly Ser Val He Ser Gly Arg Val Ala Tyr Ala Leu 
2785 2790 2795 2800 

Gly Leu Glu Gly Pro Ala Leu Thr Val Asp Thr Ala Cys Ser Ser Ser 
2805 2810 2815 

Leu Val Ala Leu His Leu Ala Val Gin Ser Leu Arg Arg Gly Glu Cys 
2820 2825 2830 

Asp Leu Ala Leu Ala Gly Gly Val Thr Val Met Ala Thr Pro Thr Val 
2835 2840 2845 

Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Ala Asp Gly Arg Cys 
2850 2855 2860 

Lys Ala Phe Ala Glu Gly Ala Asp Gly Thr Ala Trp Ala Glu Gly Val 
2865 2870 2875 2880 

Gly Val Leu Leu Val Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly His. 

2885 2890 2895 



143 



EP 0 791 655 A2 



Arg Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gin Asp Gly Ala 
2900 2905 2910 

Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ala Gin Gin Arg Val He 
2915 2920 2925 

Arg Glu Ala Leu Ala Asp Ala Gly Leu Val Pro Ala Asp Val Asp Val 
2930 2935 2940 

Val Glu Ala His Gly Thr Gly Thr Ala Leu Gly Asp Pro He Glu Ala 
2945 2950 2955 2960 

Gly Ala Leu Leu Ala Thr Tyr Gly Arg Glu Arg Val Gly Asp Pro Leu 
2965 2970 2975 

Trp Leu Gly Ser Leu Lys Ser Asn He Gly His Ala Gin Ala Ala Ala 
2980 2985 2990 

Gly Val Gly Gly Val He Lys Val Val Gin Gly Met Arg His Gly Ser 
2995 3000 3005 

Leu Pro Arg Thr Leu His Val Asp Ala Pro Ser Ser Lys Val Glu Trp 
3010 3015 3020 

Ala Ser Gly Ala Val Glu Leu Leu Thr Glu Thr Arg Ser Trp Pro Arg 
3025 3030 3035 3040 

Arg Val Glu Arg Val Arg Arg Ala Ala Val Ser Ala Phe Gly Val Ser 
3045 3050 3055 

Gly Thr Asn Ala His Val Val Leu Glu Glu Ala Pro Ala Glu Ala Gly 
3060 3065 3070 

Ser Glu His Gly Asp Gly Pro Glu Pro Glu Arg Pro Asp Ala Val Thr 
3075 3080 3085 

Gly Pro Leu Ser Trp Val Leu Ser Ala Arg Ser Glu Gly Ala Leu Arg 
3090 3095 3100 

Ala Gin Ala Val Arg Leu Arg Glu Cys Val Glu Arg Val Gly Ala Asp 
3105 3110 3115 3120 

Pro Arg Asp Val Ala Gly Ser Leu Val Val Ser Arg Ala Ser Phe Gly 
3125 3130 3135 

Glu Arg Ala Val Val Val Gly Arg Gly Arg Glu Glu Leu Leu Ala Gly 
3140 3145 3150 

Leu Asp Val Val Ala Ala Gly Ala Pro Val Gly Val Ser Ser Gly Ala 
3155 3160 3165 

Gly Ala Val Val Arg Gly Ser Ala Val Arg Gly Arg Gly Val Gly Val 
3170 3175 3180 

Leu Phe Thr Gly Gin Gly Ala Gin Trp Val Gly Met Gly Arg Gly Leu 
3185 3190 3195 3200 

Tyr Ala Gly Gly Gly Val Phe Ala Glu Val Leu Asp Glu Val Leu Ser 
3205 3210 3215 
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Val Val Gly Glu Val Asp Gly Arg Ser Leu Arg Asp Val Met Phe Ala 
3220 3225 3230 

Asp Ala Asp Ser Val Leu Gly Gly Leu Leu Gly Arg Thr Glu Phe Ala 
3235 3240 3245 

Gin Pro Ala Leu Phe Ala Leu Glu Val Ala Leu Phe Arg Ala Leu Glu 
3250 3255 3260 

Ala Arg Gly Val Glu Val Ser Val Val Leu Gly His Ser Val Gly Glu 
3265 3270 3275 3280 

Val Ala Ala Ala Tyr Val Ala Gly Val Leu Ser Leu Gly Asp Ala Val 
3285 3290 3295 

Arg Leu Val Val Ala Arg Gly Gly Leu Met Gly Gly Leu Pro Val Gly 

3300 3305 3310 

Gly Gly Met Trp Ser Val Gly Ala Ser Glu Ser Val Val Arg Gly Val 
3315 3320 3325 

Val Glu Gly Leu Gly Glu Trp Val Ser Val Ala Ala Val Asn Gly Pro 
3330 3335 3340 

Arg Ser Val Val Leu Ser Gly Asp Val Gly Val Leu Glu Ser Val Val 
3345 3350 3355 3360 

Val Thr Leu Met Gly Asp Gly Val Glu Cys Arg Arg Leu Asp Val Ser 
3365 3370 3375 

His Gly Phe His Ser Val Leu Met Glu Pro Val Leu Gly Glu Phe Arg 
3380 3385 3390 

Gly Val Val Glu Ser Leu Glu Phe Gly Arg Val Arg Pro Gly Val Val 
3395 3400 3405 

Val Val Ser Gly Val Ser Gly Gly Val Val Gly Ser Gly Glu Leu Gly 
3410 3415 3420 

Asp Pro Gly Tyr Trp Val Arg His Ala Arg Glu Ala Val Arg Phe Ala 
3425 3430 3435 3440 

Asp Gly Val Gly Val Val Arg Gly Leu Gly Val Gly Thr Leu Val Glu 
3445 3450 3455 

Val Gly Pro His Gly Val Leu Thr Gly Met Ala Gly Gin Cys Leu Glu 
3460 3465 3470 

Ala Gly Asp Asp Val Val Val Val Pro Ala Met Arg Arg Gly Arg Pro 
3475 3480 3485 

Glu Arg Glu Val Phe Glu Ala Ala Leu Ala Thr Val Phe Thr Arg Asp 
3490 3495 3500 

Ala Gly Leu Asp Ala Thr Thr Leu His Thr Gly Ser Thr Gly Arg Arg 
3505 3510 3515 3520 

He Asp Leu Pro Thr Tyr Pro Phe Gin His Asn Arg Tyr Trp Ala Thr 
3525 3530 3535 

Gly Ser Val Thr Gly Ala Thr Gly Thr Ser Ala Ala Ala Arg Phe Gly 
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3540 3545 3550 

Leu Glu Trp Lys Asp His Pro Phe Leu Ser Gly Ala Thr Pro lie Ala 
3555 3560 3565 

Gly Ser Gly Ala Leu Leu Leu Thr Gly Arg Val Gly Leu Ala Ala His 
3570 3575 3580 

Pro Trp Leu Ala Asp His Ala lie Ser Gly Thr Val Leu Leu Pro Gly 
3585 3590 3595 3600 

Thr Ala He Ala Asp Leu Leu Leu Arg Ala Val Glu Glu Val Gly Ala 
3605 3610 3615 

Gly Gly Val Glu Glu Leu Thr Leu His Glu Pro Leu Leu Leu Pro Glu 
3620 3625 3630 

Arg Gly Gly Leu His Val Gin Val Leu Val Glu Ala Ala Asp Glu Gin 
3635 3640 3645 

Gly Arg Arg Ala Val Ala Val Ala Ala Arg Pro Glu Gly Pro Gly Arg 

3650 3655 3660 

Asp Gly Glu Glu Gin Glu Trp Thr Arg His Ala Glu Gly Val Leu Thr 
3665 3670 3675 3680 

Ser Thr Glu Thr Ala Val Pro Asp Met Gly Trp Ala Ala Gly Ala Trp 
3685 3690 3695 

Pro Pro Pro Gly Ala Glu Pro He Asp Val Glu Glu Leu Tyr Asp Ala 
3700 3705 3710 

Phe Ala Ala Asp Gly Tyr Gly Tyr Gly Pro Ala Phe Thr Ala Leu Ser 
3715 3720 3725 

Gly Val Trp Arg Leu Gly Asp Glu Leu Phe Ala Glu Val Arg Arg Pro 
3730 3735 3740 

Ala Gly Gly Ala Gly Thr Thr Gly Asp Gly Phe Gly Val His Pro Ala 
3745 3750 3755 3760 

Leu Phe Asp Ala Ala Leu His Pro Trp Arg Ala Gly Gly Leu Leu Pro 
3765 3770 " 3775 

Asp Thr Gly Gly Thr Thr Trp Ala Pro Phe Ser Trp Gin Gly lie Ala 
3780 3785 3790 

Leu His Thr Thr Gly Ala Glu Thr Leu Arg Val Arg Leu Ala Pro Ala 
3795 3800 3805 

Ala Gly Gly Thr Glu Ser Ala Phe Ser Val Gin Ala Ala Asp Pro Ala 
3810 3815 3820 

Gly Thr Pro Val Leu Thr Leu Asp Ala Leu Leu Leu Arg Pro Val Thr 
3825 3830 3835 3840 

Leu Gly Arg Ala Asp Ala Pro Gin Pro Leu Tyr Arg Val Asp Trp Gin 
3845 3850 3855 

Pro Val Gly Gin Gly Thr Glu Ala Ser Gly Ala Gin Gly Trp Thr Val 
3860 3865 3870 
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Leu Gly Gin Ala Ala Ala Glu Thr Val Ala Gin Pro Ala Ala His Ala 
3875 3880 3885 

Asp Leu Thr Ala Leu Arg Thr Ala Val Ala Ala Ala Gly Thr Pro Val 
3890 3895 3900 

Pro Arg Leu Val Val Val Ser Pro Val Asp Thr Arg Leu Asp Glu Gly 
3905 3910 3915 3920 

Pro Val Leu Ala Asp Ala Glu Ala Arg Ala Arg Ala Gly Asp Gly Trp 
3925 3930 3935 

Asp Asp Asp Pro Leu Arg Val Ala Leu Gly Arg Gly Leu Thr Leu Val 
3940 3945 3950 

Arg Glu Trp Val Glu Asp Glu Arg Leu Ala Asp Ser Arg Leu Val Val 
3955 3960 3965 

Leu Thr Arg Gly Ala Val Ala Ala Gly Pro Gly Asp Val Pro Asp Leu 
3970 3975 3980 

Thr Gly Ala Ala Leu Trp Gly Leu Leu Arg Ser Ala Gin Ser Glu Tyr 
3985 3990 3995 4000 

Pro Asp Arg Phe Thr Leu He Asp Val Asp Asp Ser Pro Glu Ser Arg 

4005 4010 4015 

Ala Ala Leu Pro Arg Ala Leu Gly Ser Ala Glu Arg Gin Leu Ala Leu 
4020 4025 4030 

Arg Thr Gly Asp Val Leu Ala Pro Ala Leu Val Pro Met Ala Thr Arg 
4035 4040 4045 

Pro Ala Glu Thr Thr Pro Ala Thr Ala Val Ala Ser Ala Thr Thr Gin 
4050 4055 4060 

Thr Gin Val Thr Ala Pro Ala Pro Asp Asp Pro Ala Ala Asp Ala Val 
4065 4070 4075 4080 

Phe Asp Pro Ala Gly Thr Val Leu He Thr Gly Gly Thr Gly Ala Leu 
4085 4090 4095 

Gly Arg Arg Val Ala Ser His Leu Ala Arg Arg Tyr Gly Val Arg His 
4100 4105 4110 

Met Leu Leu Val Ser Arg Arg Gly Pro Asp Ala Pro Glu Ala Gly Pro 
4115 4120 4125 

Leu Glu Arg Glu Leu Ala Gly Leu Gly Val Thr Ala Thr Phe Leu Ala 
4130 4135 4140 

Cys Asp Leu Thr Asp He Glu Ala Val Arg Lys Ala Val Ala Ala Val 
4145 * 4150 4155 4160 

Pro Ser Asp His Pro Leu Thr Gly Val Val His Thr Ala Gly Val Leu 
4165 4170 4175 

Asp Asp Gly Ala Leu Thr Gly Leu Thr Arg Gin Arg Leu Asp Thr Val 
4180 4185 4190 
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Leu Arg Pro Lys Ala Asp Ala Val Arg Asn Leu His Glu Ala Thr Leu 
4195 4200 4205 

Asp Arg Pro Leu Arg Ala Phe Val Leu Phe Ser Ala Ala Ala Gly Leu 
4210 4215 4220 

Leu Gly Arg Pro Gly Gin Ala Ser Tyr Ala Ala Ala Asn Ala Val Leu 
4225 4230 4235 4240 



Asp Ala Leu Ala Gly Ala Arg Arg Ala Ala Gly Leu Pro Ala Val Ser 
4245 4250 4255 



Leu 


Ala 


Trp 


Gly 


Leu 


Trp 


Asp 


Glu 


Gin 


Thr 


Gly 


Met 


Ala 


Gly 


Gly 


Leu 








4260 








4265 








4270 




Asp 


Glu 


Met 


Ala 


Leu 


Arg 


Val 


Leu 


Arg 


Arg 


Asp 


Gly 


He 


Ala 


Ala 


Met 






4275 








4280 








4285 






Pro 


Pro 


Glu 


Gin 


Gly 


Leu 


Glu 


Leu 


Leu 


Asp 


Leu 


Ala 


Leu 


Thr 


Gly 


His 




4290 








4295 








4300 








Arg 


Asp 


Gly 


Pro 


Ala 


Val 


Leu 


Val 


Pro 


Leu 


Leu 


Leu 


Asp 


Gly 


Ala 


Ala 


4305 








4310 








4315 








432C 


Leu 


Arg 


Arg 


Thr 


Ala 


Lys 


Glu 


Arg 


Gly 


Ala 


Ala 


Thr 


Met 


Ser 


Pro 


Leu 










4325 








4330 








4335 


Leu 


Arg 


Ala 


Leu 


Leu 


Pro 


Ala 


Ala 


Leu 


Arg 


Arg 


Ser 


Gly 


Gly 


Ala 


Gly 








4340 








4345 








4350 




Ala 


Pro 


Ala 


Ala 


Ala 


Asp 


Arg 


His 


Gly 


Lys 


Glu 


Ala 


Asp 


Pro 


Gly 


Ala 






4355 








4360 








4365 






Gly 


Arg 


Leu 


Ala 


Gly 


Met 


Val 


Ala 


Leu 


Glu 


Ala 


Ala 


Glu 


Arg 


Ser 


Ala 




4370 








4375 








4380 








Ala 


Val 


Leu 


Glu 


Leu 


Val 


Thr 


Glu 


Gin 


Val 


Ala 


Glu 


Val 


Leu 


Gly 


Tyr 


4385 








4390 








4395 








440C 


Ala 


Ser 


Ala 


Ala 


Glu 


He 


Glu 


Pro 


Glu 


Arg 


Pro 


Phe 


Arg 


Glu 


He 


Gly 










4405 








4410 








4415 


Val 


Asp 


Ser 


Leu 


Ala 


Ala 


Val 


Glu 


Leu 


Arg 


Asn 


Arg 


Leu 


Ser 


Arg 


Leu 








4420 








4425 








4430 




Val 


Gly 


Leu 


Arg 


Leu 


Pro 


Thr 


Thr 


Leu 


Ser 


Phe 


Asp 


His 


Pro 


Thr 


Pro 






4435 








4440 








4445 






Lys 


Asp 


Met 


Ala 


Gin 


His 


He 


Asp 


Gly 


Gin 


Leu 


Pro 


Arg 


Pro 


Ala 


Gly 




4450 








4455 








4460 









Ala Ser Pro Ala Asp Ala Ala Leu Glu Gly He Gly Asp Leu Ala Arg 
4465 4470 4475 4480 



Ala Val Ala Leu Leu Gly Thr Gly Asp Ala Arg Arg Ala Glu Val Arg 
4485 * 4490 4495 

Glu Gin Leu Val Gly Leu Leu Ala Ala Leu Asp Pro Pro Gly Arg Thr 
4500 4505 4510 

Gly Thr Ala Ala Pro Gly Val Pro Ser Gly Ala Asp Gly Ala Glu Pro 
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4515 4520 4525 

Thr Val Thr Asp Arg Leu Asp Glu Ala Thr Asp Asp Glu lie Phe Ala 
4530 4535 4540 

Phe Leu Asp Glu Gin Leu 
4545 4550 



10 (2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1996 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

15 

(ii) MOLECULE TYPE: peptide 



20 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

Met Thr Ala Glu Asn Asp Lys lie Arg Ser Tyr Leu Lys Arg Ala Thr 
15 10 15 

Ala Glu Leu His Arg Thr Lys Ser Arg Leu Ala Glu Val Glu Ser Ala 
20 25 30 

Ser Arg Glu Pro He Ala He Val Gly Met Ala Cys Arg Tyr Pro Gly 
35 40 45 

Gly Val Ala Ser Pro Asp Asp Leu Trp Asp Leu Val Ala Ala Gly Thr 
30 5 0 5 5 6 0 

Asp Ala Val Ser Ala Phe Pro Val Asp Arg Gly Trp Asp Val Glu Gly 
65 70 75 80 

Leu Tyr Asp Pro Asp Pro Glu Ala Val Gly Arg Ser Tyr Val Arg Glu 
85 90 95 

35 

Gly Gly Phe Leu His Ser Ala Ala Glu Phe Asp Ala Glu Phe Phe Gly 
100 105 110 

He Ser Pro Arg Glu Ala Ala Ala Met Asp Pro Gin Gin Arg Leu Leu 
40 115 120 125 

Leu Glu Thr Ser Trp Glu Ala Leu Glu Arg Ala Gly He Val Pro Ala 
130 135 140 

4S Ser Leu Arg Gly Thr Arg Thr Gly Val Phe Thr Gly Val Met Tyr Asp 

145 150 155 160 

Asp Tyr Gly Ser Arg Phe Asp Ser Ala Pro Pro Glu Tyr Glu Gly Tyr 
165 170 175 



50 



55 



Leu Val Asn Gly Ser Ala Gly Ser He Ala Ser Gly Arg Val Ala Tyr 
180 185 190 

Ala Leu Gly Leu Glu Gly Pro Ala Leu Thr Val Asp Thr Ala Cys Ser 
195 200 205 

Ser Ser Leu Val Ala Leu His Leu Ala Val Gin Ser Leu Arg Arg Gly 
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210 



215 



220 



Glu Cys Asp Leu Ala Leu Ala Gly Gly Val Thr Val Met Ala Thr Pro 
225 230 235 240 

Thr Val Leu Val Glu Phe Ser Arg Gin Arg Oly Leu Ala Ala Asp Gly 
245 250 255 

Arg Cys Lys Ala Phe Ala Glu Gly Ala Asp Gly Thr Ala Trp Ala Glu 
260 265 270 

Gly Val Gly Val Leu Leu Val Glu Arg Leu Ser Asp Ala Arg Arg Asn 
275 280 285 

Gly His Arg Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gin Asp 
290 295 300 

Gly Ala Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ala Gin Gin Arg 
305 310 315 320 

Val lie Arg Glu Ala Leu Ala Asp Ala Gly Leu Thr Pro Ala Asp Val 
325 330 335 

Asp Ala Val Glu Ala His Gly Thr Gly Thr Pro Leu Gly Asp Pro lie 
340 345 350 

Glu Ala Gly Ala Leu Leu Ala Thr Tyr Gly Ser Glu Arg Gin Gly Gin 
355 360 365 

Gly Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn lie Gly His Ala Gin 
370 375 380 

Ala Ala Ala Gly Val Gly Gly Val He Lys Val Val Gin Ala Met Arg 
385 390 395 400 

His Gly Ser Leu Pro Arg Thr Leu His Val Asp Ala Pro Ser Ser Lys 
405 410 415 

Val. Glu Trp Ala Ser Gly Ala Val Glu Leu Leu Thr Glu Thr Arg Ser 

420 425 430 

Trp Pro Arg Arg Val Glu Arg Val Arg Arg Ala Ala Val Ser Ala Phe 
435 440 445 

Gly Val Ser Gly Thr Asn Ala His Val Val Leu Glu Glu Ala Pro Ala 
450 455 460 

Glu Ala Gly Ser Glu His Gly Asp Gly Pro Glu Pro Glu Arg Pro Asp 
465 470 475 480 

Ala Val Thr Gly Pro Leu Ser Trp Val Leu Ser Ala Arg Ser Glu Gly 
485 490 495 

Ala Leu Arg Ala Gin Ala Val Arg Leu Arg Glu Cys Val Glu Arg Val 
500 505 510 

Gly Ala Asp Pro Arg Asp Val Ala Gly Ser Leu Val Val Ser Arg Ala 
515 520 525 



Ser Phe Gly Glu Arg Ala Val Val Val Gly Arg Gly Arg Glu Glu Leu 
530 535 540 
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Leu Ala Gly Leu Asp Val Val Ala Ala Gly Ala Pro Val Gly Val Ser 
545 550 555 560 

Gly Gly Val Ser Ser Gly Ala Gly Ala Val Val Arg Gly Ser Ala Val 
565 570 575 

Arg Gly Arg Gly Val Gly Val Leu Phe Thr Gly Gin Gly Ala Gin Trp 
580 585 590 

Val Gly Met Gly Arg Gly Leu Tyr Ala Gly Gly Gly Val Phe Ala Glu 
595 600 605 

Val Leu Asp Glu Val Leu Ser Val Val Gly Glu Val Gly Gly Trp Ser 
610 615 620 

Leu Arg Asp Val Met Phe Gly Asp Val Asp Val Asp Ala Gly Ala Gly 
625 630 635 640 

Ala Asp Ala Gly Val Gly Ser Gly Val Gly Val Gly Gly Leu Leu Gly 
645 650 655 

Arg Thr Glu Phe Ala Gin Pro Ala Leu Phe Ala Leu Glu Val Ala Leu 
660 665 670 

Phe Arg Ala Leu Glu Ala Arg Gly Val Glu Val Ser Val Val Leu Gly 
675 680 685 

His Ser Val Gly Glu Val Ala Ala Ala Tyr Val Ala Gly Val Leu Ser 
690 695 700 

Leu Gly Asp Ala Val Arg Leu Val Val Ala Arg Gly Gly Leu Met Gly 
705 710 715 720 

Gly Leu Pro Val Gly Gly Gly Met Trp Ser Val Gly Ala Ser Glu Ser 
725 730 735 

Val Val Arg Gly Val Val Glu Gly Leu Gly Glu Trp Val Ser Val Ala 
740 745 750 

Ala Val Asn Gly Pro Arg Ser Val Val Leu Ser Gly Asp Val Gly Val 
755 760 765 

Leu Glu Ser Val Val Ala Ser Leu Met Gly Asp Gly Val Glu Cys Arg 

770 775 780 

Arg Leu Asp Val Ser His Gly Phe His Ser Val Leu Met Glu Pro Val 

785 790 795 800 

Leu Gly Glu Phe Arg Gly Val Val Glu Ser Leu Glu Phe Gly Arg Val 
805 810 815 

Arg Pro Gly Val Val Val Val Ser Ser Val Ser Gly Gly Val Val Gly 
820 825 830 

Ser Gly Glu Leu Gly Asp Pro Gly Tyr Trp Val Arg His Ala Arg Glu 
835 840 845 



Ala Val Arg Phe Ala Asp Gly Val Gly Val Val Arg Gly Leu Gly Val 
850 855 860 
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Gly Thr Leu Val Glu Val Gly Pro His Gly Val Leu Thr Gly Met Ala 
865 870 875 880 

Gly Glu Cys Leu Gly Ala Gly Asp Asp Val Val Val Val Pro Ala Met 
5 885 890 895 

Arg Arg Gly Arg Ala Glu Arg Glu Val Phe Glu Ala Ala Leu Ala Thr 
900 905 910 

70 v al Phe Thr Arg Asp Ala Gly Leu Asp Ala Thr Thr Leu His Thr Gly 

915 920 925 

Ser Thr Gly Arg Arg He Asp Leu Pro Thr Tyr Pro Phe Gin His Asp 
930 935 940 

76 Arg Tyr Trp Leu Ala Ala Pro Ser Arg Pro Arg Thr Asp Gly Leu Ser 

945 950 955 960 

Ala Ala Gly Leu Arg Glu Val Glu His Pro Leu Leu Thr Ala Ala Val 
965 970 975 

Glu Leu Pro Gly Thr Asp Thr Glu Val Trp Thr Gly Arg He Ser Ala 
980 985 " 990 

Ala Asp Leu Pro Trp Leu Ala Asp His Leu Val Trp Asp Arg Gly Val 
995 1000 1005 

Val Pro Gly Thr Ala Leu Leu Glu Thr Val Leu Gin Val Gly Ser Arg 
1010 1015 1020 

He Gly Leu Pro Arg Val Ala Glu Leu Val Leu Glu Thr Pro Leu Thr 
30 1025 1030 1035 1040 

Trp Thr Ser Asp Arg Pro Leu Gin Val Arg He Val Val Thr Ala Ala 
1045 1050 1055 



20 



25 



35 



Ala Thr Ala Pro Gly Gly Ala Arg Glu Leu Thr Leu His Ser Arg Pro 
1060 1065 1070 

Glu Pro Val Ala Ala Ser Ser Ser Ser Pro Ser Pro Ala Ser Pro Arg 
1075 1080 1085 

40 His Leu Thr Ala Gin Glu Ser Asp Asp Asp Trp Thr Arg His Ala Ser 

1090 1095 1100 

Gly Leu Leu Ala Pro Ala Ala Gly Leu Ala Asp Asp Phe Ala Glu Leu 
1105 IHO 1115 1120 



45 



50 



55 



Thr Gly Ala Trp Pro Pro Val Gly Ala Glu Pro Leu Asp Leu Ala Gly 

H25 1130 1135 

Gin Tyr Pro Leu Phe Ala Ala Ala Gly Val Arg Tyr Glu Gly Ala Phe 
1140 H45 H50 

Arg Gly Leu Arg Ala Ala Trp Arg Arg Gly Asp Glu Val Phe Ala Asp 
1155 H60 H65 

Val Arg Leu Pro Asp Ala His Ala Val Asp Ala Asp Arg Tyr Glv Val 
1170 H75 H80 

His Pro Ala Leu Leu Asp Ala Val Leu His Pro He Ala Ser Leu Asp 
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1185 1190 1195 1200 

Pro Leu Gly Asp Gly Gly His Gly Leu Leu Pro Phe Ser Trp Thr Asp 
1205 1210 1215 

5 

Val Gin Gly His Gly Ala Gly Gly His Ala Leu Arg Val Arg Val Ala 

1220 1225 1230 

Ala Val Asp Gly Gly Ala Val Ser Val Thr Ala Ala Asp His Ala Gly 
to l 23 ^ 1240 1245 

Asn Pro Val Leu Ser Ala Arg Ser Leu Ala Leu Arg Arg He Thr Ala 
1250 1255 1260 

Asp Arg Leu Pro Ala Ala Pro Val Ala Pro Leu Tyr Arg Val Asp Trp 
15 1265 1270 1275 1280 

Leu Pro Phe Pro Gly Pro Val Pro Val Ser Ala Gly Gly Arg Trp Ala 
1285 1290 1295 

20 Vai Val G ly Pro Giu Ala Glu Ala Thr Ala Ala Gly Leu Arg Ala Val 

1300 1305 1310 

Gly Leu Asp Val Arg Thr His Ala Leu Pro Leu Gly Glu Pro Leu Pro * 
1315 1320 1325 

25 Pro Gin Ala Gly Thr Asp Ala Glu Val He lie Leu Asp Leu Thr Thr 

1330 1335 1340 

Thr Ala Ala Gly Arg Thr Ala Ser Asp Gly Gly Arg Leu Ser Leu Leu 
1345 1350 1355 1360 

Asp Glu Val Arg Ala Thr Val Arg Arg Thr Leu Glu Ala Val Gin Ala 
1365 1370 1375 

Arg Leu Ala Asp Thr Glu Thr Ala Pro Asp Val Asp Val Arg Thr Ala 
1380 1385 1390 

Ala Arg Pro Arg Thr Ala Ala Arg Thr Ser Pro Arg Val Asp Thr Arg 
1395 1400 1405 

Thr Gly Ala Arg Thr Ala Asp Gly Pro Arg Leu Val Val Leu Thr Arg 
40 1410 1415 142 o 

Gly Ala Ala Gly Pro Glu Gly Gly Ala Ala Asp Pro Ala Gly Ala Ala 
1425 1430 1435 " 1440 



30 



35 



Val Trp Gly Leu Val Arg Val Ala Gin Ala Glu Gin Pro Gly Arg Phe 
1445 1450 1455 

Thr Leu Val Asp Val Asp Gly Thr Gin Ala Ser Leu Arg Ala Leu Pro 
1460 1465 1470 

Gly Leu Leu Ala Thr Asp Ala Gly Gin Ser Ala Val Arg Asp Gly Arg 

1475 1480 1485 

Val Thr Val Pro Arg Leu Val Pro Val Ala Asp Pro Val Pro His Gly 
1490 1495 1500 

ss Gly Gly Thr Ala Ala Asp Gly Thr Gly Ala Gly Glu Pro Ser Ala Thr 

I 505 1510 1515 1520 



45 



50 
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Leu Asp Pro Glu Gly Thr Val Leu lie Thr Gly Gly Thr Gly Ala Leu 
1525 1530 1535 

Ala Ala Glu Thr Ala Arg His Leu Val Asp Arg His Lys Val Arg His 
1540 1545 1550 

Leu Leu Leu Val Gly Arg Arg Gly Pro Asp Ala Pro Gly Val Asp Arg 
1555 1560 1565 

Leu Val Ala Glu Leu Thr Glu Ser Gly Ala Glu Val Ala Val Arg Ala 
1570 1575 1580 

Cys Asp Val Thr Asp Arg Asp Ala Leu Arg Arg Leu Leu Asp Ala Leu 
1585 1590 1595 1600 

Pro Asp Glu His Pro Leu Thr Cys Val Val His Thr Ala Gly Val Leu 
1605 1610 1615 

Asp Asp Gly Val Leu Ser Ala Gin Thr Ala Glu Arg lie Asp Thr Val 
1620 1625 1630 

Leu Arg Pro Lys Ala Asp Ala Ala Val His Leu Asp Glu Leu Thr Arg 
1635 1640* 1645 

Glu He Gly Arg Val Pro Leu Val Leu Tyr Ser Ser Val Ser Ala Thr 
1650 1655 1660 

Leu Gly Ser Ala Gly Gin Ala Gly Tyr Ala Ala Ala Asn Ala Phe Met 
1665 1670 1675 1680 

Asp Ala Leu Ala Ala Arg Arg Cys Ala Ala Gly His Pro Ala Leu Ser 
1685 1690 1695 

Leu Gly Trp Gly Trp Trp Ser Gly Val Gly Leu Ala Thr Gly Leu Asp 
1700 1705 1710 

Gly Ala Asp Ala Ala Arg Val Arg Arg Ser Gly Leu Ala Pro Leu Asp 
1715 1720 1725 

Ala Gly Ala Ala Leu Asp Leu Leu Asp Arg Ala Leu Thr Arg Pro Glu 
1730 1735 1740 

Pro Ala Leu Leu Pro Val Arg Leu Asp Leu Arg Ala Ala Ala Gly Ala 
1745 1750 1755 1760 

Thr Ala Leu Pro Glu Val Leu Arg Asp Leu Ala Gly Val Pro Ala Asp 
1765 1770 1775 

Ala Arg Ser Thr Pro Gly Ala Ala Ala Gly Thr Gly Asp Glu Asp Gly 
1780 1785 * 1790 

Ala Val Arg Pro Ala Pro Ala Pro Ala Asp Ala Ala Gly Thr Leu Ala 
1795 1800 1805 

Ala Arg Leu Ala Gly Arg Ser Ala Pro Glu Arg Thr Ala Leu Leu Leu 
1810 1815 1820 

Asp Leu Val Arg Thr Glu Val Ala Ala Val Leu Gly His Gly Asp Pro 
1825 1830 1835 * 1840 
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Ala Ala He Gly Ala Ala Arg Thr Phe Lys Asp Ala Gly Phe Asp Ser 
1845 1850 1855 

Leu Thr Ala Val Asp Leu Arg Asn Arg Leu Asn Thr Arg Thr Gly Leu 
1860 1865 1870 

Arg Leu Pro Ala Thr Leu Val Phe Asp His Pro Thr Pro Leu Ala Leu 
1875 1880 1885 

Ala Glu Leu Leu Leu Asp Gly Leu Glu Ala Ala Gly Pro Ala Glu Pro 
1890 1895 1900 

Ala Ala Glu Val Pro Asp Glu Ala Ala Gly Ala Glu Thr Leu Ser Gly 
1905 1910 1915 1920 

Val He Asp Arg Leu Glu Arg Ser Leu Ala Ala Thr Asp Asp Gly Asp 
1925 1930 1935 

Ala Arg Val Arg Ala Ala Arg Arg Leu Arg Gly Leu Leu Asp Ala Leu 
1940 1945 1950 

Pro Ala Gly Pro Gly Ala Ala Ser Gly Pro Asp Ala Gly Glu His Ala 
1955 1960 1965 

Pro Gly Arg Gly Asp Val Val He Asp Arg Leu Arg Ser Ala Ser Asp 
1970 1975 1980 

Asp Asp Leu Phe Asp Leu Leu Asp Ser Asp Phe Gin 
1985 1990 * 1995 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3724 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ser Ala Thr Asn Glu Glu Lys Leu Arg Glu Tyr Leu Arg Arg Ala 
15 10 15 

Met Ala Asp Leu His Ser Ala Arg Glu Arg Leu Arg Glu Val Glu Ser 
20 25 30 

Ala Ser Arg Glu Pro He Ala He Val Gly Met Ala Cys Arg Tyr Pro 
35 40 45 

Gly Gly Val Ala Ser Pro Glu Glu Leu Trp Asp Leu Val Ala Ala Gly 

50 55 60 

Thr Asp Ala He Ser Pro Phe Pro Val Asp Arg Gly Trp Asp Ala Glu 
65 70 75 80 

Gly Leu Tyr Asp Pro Glu Pro Gly Val Pro Gly Lys Ser Tyr Val Arg 
85 90 95 
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Glu Gly Gly Phe Leu His Ser Ala Ala Glu Phe Asp Ala Glu Phe Phe 

100 105 110 

Gly He Ser Pro Arg Glu Ala Ala Ala Met Asp Pro Gin Gin Arg Leu 
115 120 125 

Leu Leu Glu Thr Ser Trp Glu Ala Leu Glu Arg Ala Gly He Val Pro 
130 135 140 



10 



Ala Ser Leu Arg Gly Thr Arg Thr Gly Val Phe Thr Gly Val Met Tyr 
145 150 155 160 



His Asp Tyr Gly Ser His Gin Val Gly Thr Ala Ala Asp Pro Ser Gly 
165 170 175 



15 



Gin Leu Gly Leu Gly Thr Ala Gly Ser Val Ala Ser Gly Arg Val Ala 
180 185 190 



20 



Tyr Thr Leu Gly Leu Gin Gly Pro Ala Val Thr Met Asp Thr Ala Cys 
195 200 205 

Ser Ser Ser Leu Val Ala Leu His Leu Ala Val Gin Ser Leu Arg Arg 
210 215 220 



25 



Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly Ala Thr Val Leu Ala Thr 
225 230 235 240 

Pro Thr Val Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Ala Asp 
245 250 255 



30 



Gly Arg Cys Lys Ala Phe Ala Glu Gly Ala Asp Gly Thr Ala Trp Ala 
260 265 270 



Glu Gly Ala Gly Val Leu Leu Val Glu Arg Leu Ser Asp Ala Arg Arg 
275 280 285 



35 



Asn Gly His Arg Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gin 
290 295 300 



Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ala Gin Gin 
305 310 315 * 320 



40 



Arg Val He Arg Asp Ala Leu Ala Asp Ala Gly Leu Thr Pro Ala Asp 
325 330 335 



45 



Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Pro Leu Gly Asp Pro 
340 345 350 

He Glu Ala Gly Ala Leu Met Ala Thr Tyr Gly Ser Glu Arg Val Gly 
355 360 ' * 365 



50 



Asp Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn He Gly His Thr Gin 
370 375 380 

Ala Ala Ala Gly Ala Ala Gly Val He Lys Met Val Gin Ala Leu Arg 
385 390 395 400 



55 



Gin Ser Glu Leu Pro Arg Thr Leu His Val Asp Ala Pro Ser Ala Lys 
405 410 415 
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Val Glu Trp Asp Ala Gly Ala Val Gin Leu Leu Thr Gly Val Arg Pro 
420 425 430 

Trp Pro Arg Arg Glu His Arg Pro Arg Arg Ala Ala Val Ser Ala Phe 
435 440 445 

Gly Val Ser Gly Thr Asn Ala His Val He He Glu Glu Pro Pro Ala 

450 455 460 

Ala Gly Asp Thr Ser Pro Ala Gly Asp Thr Pro Glu Pro Gly Glu Ala 

465 470 475 480 

Thr Ala Ser Pro Ser Thr Ala Ala Gly Pro Ser Ser Pro Ser Ala Val 
485 490 495 

Ala Gly Pro Leu Ser Pro Ser Ser Pro Ala Val Val Trp Pro Leu Ser 
500 505 510 

Ala Glu Thr Ala Pro Ala Leu Arg Ala Gin Ala Ala Arg Leu Arg Ala 
515 520 525 

His Leu Glu Arg Leu Pro Gly Thr Ser Pro Thr Asp He Gly His Ala 
530 535 540 

Leu Ala Ala Glu Arg Ala Ala Leu Thr Arg Arg Val Val Leu Leu Gly 
545 550 555 560 

Asp Asp Gly Ala Pro Val Asp Ala Leu Ala Ala Leu Ala Ala Gly Glu 
565 570 575 

Thr Thr Pro Asp Ala Val His Gly Thr Ala Ala Asp He Arg Arg Val 
580 585 590 

Ala Phe Val Phe Pro Gly Gin Gly Ser Gin Trp Ala Gly Met Gly Ala 
595 600 605 

Glu Leu Leu Asp Thr Ala Pro Ala Phe Ala Ala Glu Leu Asp Arg Cys 
610 615 620 

Gin Gly Ala Leu Ser Pro Tyr Val Asp Trp Asn Leu Ala Asp Val Leu 
625 630 635 640 

Arg Gly Ala Pro Ala Ala Pro Gly Leu Asp Arg Val Asp Val Val Gin 
645 650 655 

Pro Ala Thr Phe Ala Val Met Val Gly Leu Ala Ala Leu Trp Arg Ser 
660 665 670 

Leu Gly Val Glu Pro Ala Ala Val He Gly His Ser Gin Gly Glu He 
675 680 685 

Ala Ala Ala Cys Val Ala Gly Ala Leu Ser Leu Glu Asp Ala Ala Arg 
690 695 700 



He Val Ala Leu Arg Ser Gin Val He Ala Arg Glu Leu Ala Gly Arg 
705 710 715 720 

Gly Gly Met Ala Ser Val Ala Leu Pro Ala Ala Glu Val Glu Ala Arg 
725 730 735 

Leu Ala Gly Gly Val Glu He Ala Ala Val Asn Gly Pro Gly Ser Thr 
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740 745 750 

Val Val Cys Gly Glu Pro Gly Ala Leu Glu Ala Leu Leu Val Thr Leu 
755 760 765 

Glu Ser Glu Gly Thr Arg Val Arg Arg He Asp Val Asp Tyr Ala Ser 
770 775 780 

His Ser His Tyr Val Glu Ser He Arg Ala Glu Leu Ala Thr Val Leu 
785 790 795 800 

Gly Pro Val Arg Pro Arg Arg Gly Asp Val Pro Phe Tyr Ser Thr Val 

805 810 815 

Glu Ala Ala Leu Leu Asp Thr Ala Thr Leu Asp Ala Asp Tyr Trp Tyr 

820 825 830 

Arg Asn Leu Arg Leu Pro Val Arg Phe Glu Pro Thr Val Arg Ala Met 
835 840 845 

Leu Asp Asp Gly Val Asp Ala Phe Val Glu Cys Ser Ala His Pro Val 
850 855 860 

Leu Thr Val Gly Val Arg Gin Thr Val Glu Ser Ala Gly Gly Ala Val 
8 *5 870 875 880 

Pro Ala Leu Ala Ser Leu Arg Arg Asp Glu Gly Gly Leu Arg Arg Phe 
885 890 895 

Leu Thr Ser Ala Ala Glu Ala Gin Val Val Gly Val Pro Val Asp Trp 
900 905 910 

Ala Thr Leu Arg Pro Gly Ala Gly Arg Val Asp Leu Pro Thr Tyr Ala 
915 920 925 

Phe Gin Arg Glu Arg His Trp Val Gly Pro Ala Arg Pro Asp Ser Ala 
930 935 940 

Ala Thr Ala Ala Thr Thr Gly Asp Asp Ala Pro Glu Pro Gly Asp Arg 
945 950 955 960 

Leu Gly Tyr His Val Ala Trp Lys Gly Leu Arg Ser Thr Thr Gly Gly 
965 970 975 

Trp Arg Pro Gly Leu Arg Leu Leu He Val Pro Thr Gly Asp Gin Tyr 
980 985 990 

Thr Ala Leu Ala Asp Thr Leu Glu Gin Ala Val Ala Ser Phe Gly Gly 
995 1000 1005 

Thr Val Arg Arg Val Ala Phe Asp Pro Ala Arg Thr Gly Arg Ala Glu 
1010 1015 1020 

Leu Phe Gly Leu Leu Glu Thr Glu He Asn Gly Asp Thr Ala Val Thr 
1025 1030 1035 1040 

Gly Val Val Ser Leu Leu Gly Leu Cys Thr Asp Gly Arg Pro Asp His 
1045 1050 1055 

Pro Ala Val Pro Val Ala Val Thr Ala Thr Leu Ala Leu Val Gin Ala 
1060 1065 1070 
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Leu Ala Asp Leu Gly Ser Thr Ala Pro Leu Tip Thr Val Thr Cys Gly 
1075 1080 1085 

Ala Val Ala Thr Ala Pro Asp Glu Leu Pro Cys Thr Ala Gly Ala Gin 
1090 1095 1100 

Leu Trp Gly Leu Gly Arg Val Ala Ala Leu Glu Leu Pro Glu Val Trp 
1105 mo ins n20 

Gly Gly Leu lie Asp Leu Pro Ala Arg Pro Asp Ala Arg Val Leu Asp 
H25 H30 H35 

Arg Leu Ala Gly Val Leu Ala Glu Pro Gly Gly Glu Asp Gin He Ala 
1140 H45 H50 

Val Arg Met Ala Gly Val Phe Gly Arg Arg Val Leu Arg Asn Pro Ala 

1155 H60 H65 

Asp Ser Arg Pro Pro Ala Trp Arg Ala Arg Gly Thr Val Leu He Ala 
H70 1175 H80 

Gly Asp Leu Thr Thr Val Pro Gly Arg Leu Val Arg Ser Leu Leu Glu 
1185 H90 H95 1200 

Asp Gly Ala Asp Arg Val Val Leu Ala Gly Pro Asp Ala Pro Ala Gin 
1205 1210 1215 

Ala Ala Ala Ala Gly Leu Thr Gly Val Ser Leu Val Pro Val Arg Cys 
1220 1225 1230 

Asp Val Thr Asp Arg Ala Ala Leu Ala Ala Leu Leu Asp Glu His Ala 
1235 1240 1245 

Pro Thr Val Ala Val His Ala Pro Pro Leu Val Pro Leu Ala Pro Leu 
1250 1255 1260 

Arg Glu Thr Ala Pro Gly Asp lie Ala Ala Ala Leu Ala Ala Lys Thr 
1265 1270 1275 1280 

Thr Ala Ala Gly His Leu Val Asp Leu Ala Pro Ala Ala Gly Leu Asp 
1285 1290 1295 

Ala Leu Val Leu Phe Ser Ser Val Ser Gly Val Trp Gly Gly Ala Ala 
1300 1305 1310 

Gin Gly Gly Tyr Ala Ala Ala Ser Ala His Leu Asp Ala Leu Ala Glu 
1315 1320 1325 

Arg Ala Arg Ala Ala Gly Val Pro Ala Phe Ser Val Ala Trp Ser Pro 
1330 1335 1340 

Trp Ala Gly Gly Thr Pro Ala Asp Gly Ala Glu Ala Glu Phe Leu Ser 
1345 1350 1355 1360 

Arg Arg Gly Leu Ala Pro Leu Asp Pro Asp Gin Ala Val Arg Thr Leu 
1365 1370 1375 

Arg Arg Met Leu Glu Arg Gly Ser Ala Cys Gly Ala Val Ala Asp Val 
1380 1385 1390 
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Glu Trp Ser Arg Phe Ala Ala Ser Tyr Thr Trp Val Arg Pro Ala Val 
1395 1400 1405 

Leu Phe Asp Asp He Pro Asp Val Gin Arg Leu Arg Ala Ala Glu Leu 
1410 1415 1420 

Ala Pro Ser Thr Gly Asp Ser Thr Thr Ser Glu Leu Val Arg Glu Leu 
1425 1430 1435 1440 

Thr Ala Gin Ser Gly His Lys Arg His Ala Thr Leu Leu Arg Leu Val 
1445 1450 1455 

Arg Ala His Ala Ala Ala Val Leu Gly Gin Ser Ser Gly Asp Ala Val 
1460 1465 1470 

Ser Ser Ala Arg Ala Phe Arg Asp Leu Gly Phe Asp Ser Leu Thr Ala 
1475 1480 1485 

Leu Glu Leu Arg Asp Arg Leu Ser Thr Ser Thr Gly Leu Lys Leu Pro 
1490 1495 1500 

Thr Ser Leu Val Phe Asp His Ser Ser Pro Ala Ala Leu Ala Arg His 
1505 1510 1515 1520 

Leu Gly Glu Glu Leu Leu Gly Arg Asn Asp Thr Ala Asp Arg Ala Gly 
1525 1530 1535 

Pro Asp Thr Pro Val Arg Thr Asp Glu Pro He Ala He He Gly Met 
1540 1545 1550 

Ala Cys Arg Leu Pro Gly Gly Val Gin Ser Pro Glu Asp Leu Trp Asp 
1555 1560 1565 

Leu Leu Thr Gly Gly Thr Asp Ala He Thr Pro Phe Pro Thr Asn Arg 
1570 1575 1580 

Gly Trp Asp Asn Glu Thr Leu Tyr Asp Pro Asp Pro Asp Ser Pro Gly 
1585 1590 1595 * 1600 

His His Thr Tyr Val Arg Glu Gly Gly Phe Leu His Asp Ala Ala Glu 
1605 1610 1615 

Phe Asp Pro Gly Phe Phe Gly He Ser Pro Arg Glu Ala Leu Ala Met 
1620 1625 1630 

Asp Pro Gin Gin Arg Leu He Leu Glu Thr Ser Trp Glu Ser Phe Glu 
1635 1640 1645 

Arg Ala Gly He Asp Pro Val Glu Leu Arg Gly Ser Arg Thr Gly Val 
1650 1655 1660 

Phe Val Gly Thr Asn Gly Gin His Tyr Val Pro Leu Leu Gin Asp Gly 
1665 1670 1675 * 1680 

Asp Glu Asn Phe Asp Gly Tyr He Ala Thr Gly Asn Ser Ala Ser Val 
1685 1690 1695 

Met Ser Gly Arg Leu Ser Tyr Val Phe Gly Leu Glu Gly Pro Ala Val 
1700 1705 1710 

Thr Val Asp Thr Ala Cys Ser Ala Ser Leu Ala Ala Leu His Leu Ala 
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1715 1720 1725 

Val Gin Ser Leu Arg Arg Gly Glu Cys Asp Tyr Ala Leu Ala Gly Gly 
1730 1735 1740 

Ala Thr Val Met Ser Thr Pro Glu Met Leu Val Glu Phe Ala Arg Gin 
1745 1750 1755 1760 

Arg Ala Val Ser Pro Asp Gly Arg Ser Lys Ala Phe Ala Glu Ala Ala 
1765 1770 1775 

Asp Gly Val Gly Leu Ala Glu Gly Ala Gly Met Leu Leu Val Glu Arg 
1780 1785 1790 

Leu Ser Glu Ala Gin Lys Lys Gly His Pro Val Leu Ala Val Val Arg 
1795 1800 1805 

Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro 
1810 1815 1820 

Ser Gly Pro Ala Gin Gin Arg Val lie Arg Glu Ala Leu Ala Asp Ala 
1825 1830 1835 1840 

Gly Leu Thr Pro Ala Asp Val Asp Ala Val Glu Ala His Gly Thr Gly 
1845 1850 1855 

Thr Pro Leu Gly Asp Pro lie Glu Ala Gly Ala Leu Leu Ala Thr Tyr 

1860 1865 1870 

Gly Arg Asp Arg Arg Asp Gly Pro Leu Trp Leu Gly Ser Leu Lys Ser 
1875 1880 1885 

Asn lie Gly His Thr Gin Ala Ala Ala Gly Val Ala Gly Val lie Lys 
1890 1895 1900 

Met Val Leu Ala Leu Arg His Gly Glu Leu Pro Arg Thr Leu His Ala 
1905 1910 1915 1920 

Ser Thr Ala Ser Ser Arg He Asp Trp Asp Ala Gly Ala Val Glu Leu 
1925 1930 1935 

Leu Asp Glu Ala Arg Pro Trp Leu Gin Arg Ala Glu Gly Pro Arg Arg 
1940 1945 1950 

Ala Gly He Ser Ser Phe Gly He Ser Gly Thr Asn Ala His Leu Val 
1955 1960 1965 

He Glu Glu Pro Pro Glu Pro Thr Ala Pro Glu Leu Leu Ala Pro Glu 
1970 1975 1980 

Pro Ala Ala Asp Gly Asp Val Trp Ser Glu Glu Trp Trp His Glu Val 
1985 1990 1995 2000 

Thr Val Pro Leu Met Met Ser Ala His Asn Glu Ala Ala Leu Arg Asp 
2005 2010 2015 

Gin Ala Arg Arg Leu Arg Ala Asp Leu Leu Ala His Pro Glu Leu His 
2020 2025 2030 

Pro Ala Asp Val Gly Tyr Thr Leu He Thr Thr Arg Thr Arg Phe Glu 
2035 2040 2045 
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Gin Arg Ala Ala Val Val Gly Glu Asn Phe Thr Glu Leu He Ala Ala 
2050 2055 2060 

Leu Asp Asp Leu Val Glu Gly Arg Pro His Pro Leu Val Leu Arg Gly 
2065 2070 2075 2080 

Thr Ala Gly Thr Ser Asp Gin Val Val Phe Val Phe Pro Gly Gin Gly 
2085 2090 2095 

Ser Gin Trp Pro Glu Met Ala Asp Gly Leu Leu Ala Arg Ser Ser Gly 
2100 2105 2110 

Ser Gly Ser Phe Leu Glu Thr Ala Arg Ala Cys Asp Leu Ala Leu Arg 
2115 2120 2125 

Pro His Leu Gly Trp Ser Val Leu Asp Val Leu Arg Arg Glu Pro Gly 
2130 2135 2140 

Ala Pro Ser Leu Asp Arg Val Asp Val Val Gin Pro Val Leu Phe Thr 
2145 2150 2155 2160 

Met Met Val Ser Leu Ala Glu Thr Trp Arg Ser Leu Gly Val Glu Pro 
2165 2170 2175 

Ala Ala Val Val Gly His Ser Gin Gly Glu He Ala Ala Ala Tyr Val 
2180 2185 2190 

Ala Gly Ala Leu Thr Leu Asp Asp Ala Ala Arg He Val Ala Leu Arg 
2195 2200 2205 

Ser Gin Ala Trp Leu Arg Leu Ala Gly Lys Gly Gly Met Val Ala Val 

2210 2215 2220 

Thr Leu Ser Glu Arg Asp Leu Arg Pro Arg Leu Glu Pro Trp Ser Asp 

2225 2230 2235 2240 

Arg Leu Ala Val Ala Ala Val Asn Gly Pro Glu Thr Cys Ala Val Ser 
2245 2250 ' 2255 

Gly Asp Pro Asp Ala Leu Ala Glu Leu Val Ala Glu Leu Gly Ala Glu 
2260 2265 2270 

Gly Val His Ala Arg Pro He Pro Gly Val Asp Thr Ala Gly His Ser 
2275 2280 2285 

Pro Gin Val Asp Thr Leu Glu Ala His Leu Arg Lys Val Leu Ala Pro 
2290 2295 2300 

Val Ala Pro Arg Thr Ser Asp He Pro Phe Tyr Ser Thr Val Thr Gly 
2305 2310 2315 2320 

Gly Leu He Asp Thr Ala Glu Leu Asp Ala Asp Tyr Trp Tyr Arg Asn 
2325 2330 2335 

Met Arg Glu Pro Val Glu Phe Glu Gin Ala Thr Arg Ala Leu He Ala 
2340 2345 2350 

Asp Gly His Asp Val Phe Leu Glu Ser Ser Pro His Pro Met Leu Ala 
2355 2360 2365 
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Val Ser Leu Gin Glu Thr He Ser Asp Ala Gly Ser Pro Ala Ala Val 
2370 2375 2380 

Leu Gly Thr Leu Arg Arg Gly Gin Gly Gly Pro Arg Trp Leu Gly Val 
2385 2390 2395 2400 

Ala Leu Cys Arg Ala Tyr Thr His Gly Leu Glu He Asp Ala Glu Ala 
2405 2410 2415 

He Phe Gly Pro Asp Ser Arg Gin Val Glu Leu Pro Thr Tyr Pro Phe 
2420 2425 2430 

Gin Arg Glu Arg Tyr Trp Tyr Ser Pro Gly His Arg Gly Asp Asp Pro 
2435 2440 2445 

Ala Ser Leu Gly Leu Asp Ala Val Asp His Pro Leu Leu Gly Ser Gly 
2450 2455 2460 

Val Glu Leu Pro Glu Ser Gly Asp Arg Met Tyr Thr Ala Arg Leu Gly 
2465 2470 2475 2480 

Ala Asp Thr Thr Pro Trp Leu Ala Asp His Ala Leu Leu Gly Ser Pro 
2485 2490 2495 

Leu Leu Pro Gly Ala Ala Phe Ala Asp Leu Ala Leu Trp Ala Gly Arg 
2500 2505 2510 

Gin Ala Gly Thr Gly Arg Val Glu Glu Leu Thr Leu Ala Ala Pro Leu 
2515 2520 2525 

Val Leu Pro Gly Ser Gly Gly Val Arg Leu Arg Leu Asn Val Gly Ala 
2530 2535 2540 

Pro Gly Thr Asp Asp Ala Arg Arg Phe Ala Val His Ala Arg Ala Glu 
2545 2550 2555 2560 

Gly Ala Thr Asp Trp Thr Leu His Ala Glu Gly Leu Leu Thr Ala Gin 
35 2565 2570 2575 

Asp Thr Ala Asp Ala Pro Asp Ala Ser Ala Ala Thr Pro Pro Pro Gly 

2580 2585 2590 

Ala Glu Gin Leu Asp He Gly Asp Phe Tyr Gin Arg Phe Ser Glu Leu 
40 2595 2600 2605 

Gly Tyr Gly Tyr Gly Pro Phe Phe Arg Gly Leu Val Ser Ala His Arg 
2610 2615 2620 



10 



15 



20 



25 



30 



45 



Cys Gly Pro Asp He His Ala Glu Val Ala Leu Pro Val Gin Ala Gin 

2625 2630 2635 2640 

Gly Asp Ala Ala Arg Phe Gly He His Pro Ala Leu Leu Asp Ala Ala 
2645 2650 2655 

SO Leu Gin Thr Met Ser Leu Gly Gly Phe Phe Pro Glu Asp Gly Arg Val 

2660 2665 2670 

Arg Met Pro Phe Ala Leu Arg Gly Val Arg Leu Tyr Arg Ala Gly Ala 
2675 2680 2685 



55 



Asp Arg Leu His Val Arg Val Ser Pro Val Ser Glu Asp Ala Val Arg 
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2690 2695 2700 

lie Arg Cys Ala Asp Gly Glu Gly Arg Pro Val Ala Glu lie Glu Ser 
2705 2710 2715 2720 

5 

Phe lie Met Arg Pro Val Asp Pro Gly Gin Leu Leu Gly Gly Arg Pro 
2725 2730 2735 

Val Gly Ala Asp Ala Leu Phe Arg lie Ala Trp Arg Glu Leu Ala Ala 
10 2740 2745 2750 

Gly Pro Gly Thr Arg Thr Gly Asp Gly Thr Pro Pro Pro Val Arg Trp 
2755 2760 2765 

Val Leu Ala Gly Pro Asp Ala Leu Gly Leu Ala Glu Ala Ala Asp Ala 
15 2770 2775 2780 

His Leu Pro Ala Val Pro Gly Pro Asp Gly Ala Leu Pro Ser Pro Thr 
2785 2790 2795 2800 

20 Gly Arg Pro Ala Pro Asp Ala Val Val Phe Ala Val Arg Ala Gly Thr 

2805 2810 2815 

Gly Asp Val Ala Ala Asp Ala His Thr Val Ala Cys Arg Val Leu Asp 
2820 2825 2830 

25 Leu Val Gin Arg Arg Leu Ala Ala Pro Glu Gly Pro Asp Gly Ala Arg 

2835 2840 2845 

Leu Val Val Ala Thr Arg Gly Ala Val Ala Val Arg Asp Asp Ala Glu 
2850 2855 2860 

Val Asp Asp Pro Ala Ala Ala Ala Ala Trp Gly Leu Leu Arg Ser Ala 
2865 2870 2875 2880 

Gin Ala Glu Glu Pro Gly Arg Phe Leu Leu Val Asp Leu Asp Asp Asp 
2885 2890 * 2895 

Pro Ala Ser Ala Arg Ala Leu Thr Asp Ala Leu Ala Ser Gly Glu Pro 
2900 2905 2910 

Gin Thr Ala Val Arg Ala Gly Thr Val Tyr Val Pro Arg Leu Glu Arg 
40 2915 2920 2925 

Ala Ala Asp Arg Thr Asp Gly Pro Leu Thr Pro Pro Asp Asp Gly Ala 
2930 2935 2940 



30 



35 



45 



Trp Arg Leu Gly Arg Gly Thr Asp Leu Thr Leu Asp Gly Leu Ala Leu 
2945 2950 2955 2960 

Val Pro Ala Pro Asp Ala Glu Ala Pro Leu Glu Pro Gly Gin Val Arg 
2965 2970 2975 

Val Ala Val Arg Ala Ala Gly Val Asn Phe Arg Asp Ala Leu He Ala 
so 2980 2985 2990 

Leu Gly Met Tyr Pro Gly Glu Ala Glu Met Gly Thr Glu Gly Ala Gly 
2995 3000 3005 

55 Thr Val Val Glu Val Gly Pro Gly Val Thr Gly Val Ala Val Gly Asp 

3010 3015 3020 
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w 



is 



Arg Val Leu Gly Leu Trp Asp Gly Gly Leu Gly Pro Leu Cys Val Ala 
3025 3030 3035 * 3040 

Asp His Arg Leu Leu Ala Pro Val Pro Asp Gly Trp Ser Tyr Ala Gin 
3045 3050 3055 

Ala Ala Ser Val Pro Ala Val Phe Leu Ser Ala Tyr Tyr Gly Leu Val 
3060 3065 3070 

Thr Leu Ala Gly Leu Arg Pro Gly Glu Arg Val Leu Val His Ala Ala 
3075 3080 3085 

Ala Gly Gly Val Gly Met Ala Ala Val Gin He Ala Arg His Leu Gly 
3090 3095 310 0 

Ala Glu Val Leu Ala Thr Ala Ser Pro Gly Lys Trp Asp Ala Leu Arg 
3105 3110 3115 3120 

Ala Met Gly He Thr Asp Asp His Leu Ala Ser Ser Arg Thr Leu Asp 
20 3125 3X30 " 3135 

Phe Ala Thr Ala Phe Thr Gly Ala Asp Gly Thr Ser Arg Ala Asp Val 
3140 3145 3150 

2S Val Leu Asn Ser Leu Th r Lys Glu Phe Val Asp Ala Ser Leu Gly Leu 

3155 3160 3165 

LeU tf? n Pr ° Gly Gly Ar3 Phe L6U Glu Leu G1 V L y* Thr Asp Val Arg 
3170 3175 3180 

Asp Pro Glu Arg lie Ala Ala Glu His Pro Gly Val Arg Tyr Arg Ala 
3185 3190 3195 3200 

Phe Asp Leu Asn Glu Ala Gly Pro Asp Ala Leu Gly Arg Leu Leu Arg 
3205 3210 3215 

Glu Leu Met Asp Leu Phe Ala Ala Gly Val Leu His Pro Leu Pro Val 
3220 3225 32 30 

Val Thr His Asp Val Arg Arg Ala Ala Asp Ala Leu Arg Thr lie Ser 
3235 3240 3245 

° ln ^c^ 9 Hi3 Thr Gly Lys Leu Val Leu ^ Met Pro Pro Ala Trp 
3250 3255 326O 

His Pro Tyr Gly Thr Val Leu Val Thr Gly Gly Thr Gly Ala Leu Gly 
4S 3265 3270 3275 * 32 80 

Ser Arg lie Ala Arg His Leu Ala Ser Arg His Gly Val Arg Arg Leu 
3285 3290 32 9S 

Leu He Ala Ala Arg Arg Gly Pro Asp Gly Glu Gly Ala Ala Glu Leu 
so 3300 3305 3310 

Val Ala Asp Leu Ala Ala Leu Gly Ala Ser Ala Thr Val Val Ala Cys 
3315 3320 3325 

^ P ™ Ser ASP Ala ASP Ala Val Ar 9 Gly Leu Le » Ala Gly lie Pro 
55 3330 3335 3340 



30 



35 



40 
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Ala Asp His Pro Leu Thr Ala Val Val His Ser Thr Gly Val Leu Asp 
3345 3350 3355 3360 

Asp Gly Val Leu Pro Gly Leu Thr Pro Glu Arg Met Arg Arg Val Leu 
3365 3370 3375 

Arg Pro Lys Val Glu Ala Ala Val His Leu Asp Glu Leu Thr Arg Asp 
3380 3385 3390 

Leu Asp Leu Ser Ala Phe Val Leu Phe Ser Ser Ser Ala Gly Leu Leu 
3395 3400 3405 

Gly Ser Pro Ala Gin Gly Asn Tyr Ala Ala Ala Asn Ala Thr Leu Asp 
3410 3415 3420 

Ala Leu Ala Ala Arg Arg Arg Ser Leu Gly Leu Pro Ser Val Ser Leu 
3425 3430 3435 3440 

Ala Trp Gly Leu Trp Ser Asp Thr Ser Arg Met Ala His Ala Leu Asp 
3445 3450 3455 

Gin Glu Ser Leu Gin Arg Arg Phe Ala Arg Ser Gly Phe Pro Pro Leu 
3460 3465 3470 

Ser Ala Thr Leu Gly Ala Ala Leu Phe Asp Ala Ala Leu Arg Val Asp 
3475 3480 3485 

Glu Ala Val Gin Val Pro Met Arg Phe Asp Pro Ala Ala Leu Arg Ala 
3490 3495 3500 

Thr Gly Ser Val Pro Ala Leu Leu Ser Asp Leu Val Gly Ser Ala Pro 
30 3505 3510 3515 3520 

Ala Thr Gly Ser Ala Ala Pro Ala Ser Gly Pro Leu Pro Ala Pro Asp 
3525 3530 3535 

Ala Gly Thr Val Gly Glu Pro Leu Ala Glu Arg Leu Ala Gly Leu Ser 
35 3540 3545 3550 

Ala Glu Glu Arg His Asp Arg Leu Leu Gly Leu Val Gly Glu His Val 
3555 3560 3565 

40 Ala Ala Val Leu Gly His Gly Ser Ala Ala Glu Val Arg Pro Asp Arg 

3570 , 3575 3580 

Pro Phe Arg Glu Val Gly Phe Asp Ser Leu Thr Ala Val Glu Leu Arg 
3585 3590 3595 3600 



10 



15 



20 



25 



45 



Asn Arg Met Ala Ala Val Thr Gly Val Arg Leu Pro Ala Thr Leu Val 
3605 3610 3615 



Phe Asp His Pro Thr Pro Ala Ala Leu Ser Ser His Leu Asp Gly Leu 
3620 3625 3630 

SO Leu Ala Pro Ala Gin Pro Val Thr Thr Thr Pro Leu Leu Ser Glu Leu 

3635 3640 3645 

Asp Arg He Glu Glu Ala Leu Ala Ala Leu Thr Pro Glu His Leu Ala 
3650 3655 3660 



55 



Glu Leu Ala Pro Ala Pro Asp Asp Arg Ala Glu Val Ala Leu Arg Leu 
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3665 3670 3675 3680 

Asp Ala Leu Ala Asp Arg Trp Arg Ala Leu His Asp Gly Ala Pro Gly 
3685 3690 3695 

Ala Asp Asp Asp He Thr Asp Val Leu Ser Ser Ala Asp Asp Asp Glu 
3700 3705 3710 

He Phe Ala Phe He Asp Glu Arg Tyr Gly Thr Ser 
3715 3720 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1580 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Ala Asn Glu Glu Lys Leu Arg Ala Tyr Leu Lys Arg Val Thr Gly 
1 5 10 ~ ~ 15 

Glu Leu His Arg Ala Thr Glu Gin Leu Arg Ala Leu Asp Arg Arg Ala 
20 25 30 

His Glu Pro He Ala He Val Gly Ala Ala Cys Arg Leu Pro Gly Gly 
35 40 45 

Val Glu Ser Pro Asp Asp Leu Trp Glu Leu Leu His Ala Gly Ala Asp 
50 55 60 

Ala Val Gly Pro Ala Pro Ala Asp Arg Gly Trp Asp Val Glu Gly Arg 
65 70 75 80 

Tyr Ser Pro Asp Pro Asp Thr Pro Gly Thr Ser Tyr Cys Arg Glu Gly 
85 90 95 

Gly Phe Val Gin Gly Ala Asp Arg Phe Asp Pro Ala Leu Phe Gly He 
100 105 110 

Ser Pro Asn Glu Ala Leu Thr Met Asp Pro Gin Gin Arg Leu Leu Leu 
115 120 125 

Glu Thr Ser Trp Glu Ala Leu Glu Arg Ala Gly Leu Asp Pro Gin Ser 
130 135 140 

Leu Ala Gly Ser Arg Thr Gly Val Phe Ala Gly Ala Trp Glu Ser Gly 
145 150 155 160 

Tyr Gin Lys Gly Val Glu Gly Leu Glu Ala Asp Leu Glu Ala Gin Leu 

165 170 175 

Leu Ala Gly He Val Ser Phe Thr Ala Gly Arg Val Ala Tyr Ala Leu 

180 185 190 

Gly Leu Glu Gly Pro Ala Leu Thr He Asp Thr Ala Cys Ser Ser Ser 
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195 



200 



205 



10 



Leu Val Ala Leu His Leu Ala Val Gin Ser Leu Arg Arg Gly Glu Cys 
210 215 220 

Asp Leu Ala Leu Ala Gly Gly Ala Thr Val He Ala Asp Phe Ala Leu 
225 230 235 240 

Phe Thr Gin Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg Cys 
245 250 " 255 

Lys Ala Phe Gly Glu Thr Ala Asp Gly Phe Gly Pro Ala Glu Gly Ala 
260 265 270 



15 



Gly Met Leu Leu Val Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly His 
275 280 285 



Pro Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gin Asp Gly Ala 
290 295 300 



20 



Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ala Gin Gin Arg Val lie 
305 310 315 " 320 



Arg Glu Ala Leu Ala Asp Ala Gly Leu Thr Pro Ala Asp Val Asp Ala 
325 330 * 335 



25 



Val Glu Ala His Gly Thr Gly Thr Pro Leu Gly Asp Pro He Glu Ala 
340 345 350 



30 



Gly Ala Leu Met Ala Thr Tyr Gly His Glu Arg Thr Gly Asp Pro Leu 
355 360 365 

Trp Leu Gly Ser Leu Lys Ser Asn He Gly His Thr Gin Ala Ala Ala 
370 375 380 



35 



Gly Val Ala Gly Val He Lys Met Val Leu Ala Leu Arg His Gly Glu 
385 390 395 400 

Leu Pro Arg Thr Leu His Ala Ser Thr Ala Ser Ser Arg He Glu Trp 
405 410 415 



40 



Asp Ala Gly Ala Val Glu Leu Leu Asp Glu Ala Arg Pro Trp Pro Arg 
420 425 430 



Arg Ala Glu Gly Pro Arg Arg Ala Gly He Ser Ser Phe Gly He Ser 
435 440 445 



45 



Gly Thr Asn Ala His Leu Val He Glu Glu Glu Pro Pro Ala Arg Pro 
450 455 460 



Glu Pro Glu Glu Ala Ala Gin Pro Pro Ala Pro Ala Thr Thr Val Leu 
465 470 475 480 



50 



Pro Leu Ser Ala Ala Gly Ala Arg Ser Leu Arg Glu Gin Ala Arg Arg 
485 490 495 



55 



Leu Ala Ala His Leu Ala Gly His Glu Glu lie Thr Ala Ala Asp Ala 
500 505 510 

Ala Arg Ser Ala Ala Thr Thr Arg Ala Ala Leu Ser His Arg Ala Ser 
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515 520 525 

Val Leu Ala Asp Asp Arg Arg Ala Leu lie Asp Arg Leu Thr Ala Leu 
530 535 540 

Ala Glu Asp Arg Lys Asp Pro Gly Val Thr Val Gly Glu Ala Gly Ser 
545 550 555 560 



10 



Gly Arg Pro Pro Val Phe Val Phe Pro Gly Gin Gly Ser Gin Trp Thr 
565 570 575 

Gly Met Gly Ala Glu Leu Leu Asp Arg Ala Pro Val Phe Arg Ala Lys 
580 585 590 



75 



Ala Glu Glu Cys Ala Arg Ala Leu Ala Ala His Leu Asp Trp Ser Val 
595 600 605 

Leu Asp Val Leu Arg Asp Ala Pro Gly Ala Pro Pro lie Asp Arg Ala 
610 615 620 



20 



Asp Val Val Gin Pro Thr Leu Phe Thr Met Met Val Ser Leu Ala Ala 
625 630 635 640 



Leu Trp Glu Ser His Gly Val Arg Pro Ala Ala Val Val Gly His Ser 
645 650 655 



25 



Gin Gly Glu He Ala Ala Ala His Ala Ala Gly Ala Leu Ser Leu Asp 
660 665 670 



Asp Ala Ala Arg Val He Ala Glu Arg Ser Arg Leu Trp Lys Arg Leu 
675 680 685 



30 



Ala Gly Asn Gly Gly Met Leu Ser Val Met Ala Pro Ala Asp Arg Val 
690 695 700 



Arg Glu Leu Met Glu Pro Trp Ala Glu Arg Met Ser Val Ala Ala Val 
705 710 715 720 



35 



Asn Gly Pro Ala Ser Val Thr Val Ala Gly Asp Ala Arg Ala Leu Glu 
725 730 735 



40 



45 



Glu Phe Gly Gly Arg Leu Ser Ala Ala Gly Val Leu Arg Trp Pro Leu 
740 745 750 

Ala Gly Val Asp Phe Ala Gly His Ser Pro Gin Val Glu Gin Phe Arg 
755 760 765 

Ala Glu Leu Leu Asp Thr Leu Gly Thr Val Arg Pro Thr Ala Ala Arg 
770 775 780 

Leu Pro Phe Phe Ser Thr Val Thr Ala Ala Ala His Glu Pro Glu Gly 
785 790 795 800 



50 



Leu Asp Ala Ala Tyr Trp Tyr Arg Asn Met Arg Glu Pro Val Glu Phe 
805 810 815 



Ala Ser Thr Leu Arg Thr Leu Leu Arg Glu Gly His Arg Thr Phe Val 
820 825 830 



55 



Glu Met Gly Pro His Pro Leu Leu Gly Ala Ala He Asp Glu Val Ala 
835 840 845 
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Glu Ala Glu Gly Val His Ala Thr Ala Leu Ala Thr Leu His Arg Gly 
850 855 860 

Ser Gly Gly Leu Asp Arg Phe Arg Ser Ser Val Gly Ala Ala Phe Ala 
865 870 875 880 

His Gly Val Arg Val Asp Trp Asp Ala Leu Phe Glu Cly Ser Gly Ala 
885 890 895 

Arg Arg Val Pro Leu Pro Thr Tyr Ala Phe Ser Arg Asp Arg Tyr Trp 
900 905 910 

Leu Pro Thr Ala lie Gly Arg Arg Ala Val Glu Ala Ala Pro Val Asp 
915 920 925 

Ala Ser Ala Pro Gly Arg Tyr Arg Val Thr Trp Thr Pro Val Ala Ser 
930 935 940 

Asp Asp Ser Gly Arg Pro Ser Gly Arg Trp Leu Leu Val Gin Thr Pro 
945 950 955 960 

Gly Thr Ala Pro Asp Glu Ala Asp Thr Ala Ala Ser Ala Leu Gly Ala 
965 970 975 

Ala Gly Val Val Val Glu Arg Cys Leu Leu Asp Pro Thr Glu Ala Ala 
980 985 990 

Arg Val Thr Leu Thr Glu Arg Leu Ala Glu Leu Asp Ala Gin Pro Glu 
995 1000 1005 

Gly Leu Ala Gly Val Leu Val Leu Pro Gly Arg Pro Gin Ser Thr Ala 
1010 1015 1020 

Pro Ala Asp Ala Ser Pro Leu Asp Pro Gly Thr Ala Ala Val Leu Leu 
1025 1030 1035 1040 

Val Val Gin Ala Val Pro Asp Ala Ala Pro Lys Ala Arg lie Trp Val 
1045 1050 * 1055 

Val Thr Arg Gly Ala Val Ala Val Gly Ser Gly Glu Val Pro Cys Ala 
1060 1065 1070 

Val Gly Ala Arg Val Trp Gly Leu Gly Arg Val Ala Ala Leu Glu Val 
1075 1080 1085 

Pro Val Gin Trp Gly Gly Leu Val Asp Val Ala Val Gly Ala Gly Val 
1090 1095 1100 

Arg Glu Trp Arg Arg Val Val Gly Val Val Ala Gly Gly Gly Glu Asp 
1105 1110 1115 1120 

Gin Val Ala Val Arg Gly Gly Gly Val Phe Gly Arg Arg Leu Val Gly 
1125 1130 1135 

Val Gly Val Arg Gly Gly Ser Gly Val Trp Arg Ala Arg Gly Cys Val 
1140 1145 1150 

Val Val Thr Gly Gly Leu Gly Gly Val Gly Gly His Val Ala Arg Trp 
1155 1160 1165 



170 



EP 0 791 655 A2 



Leu Ala Arg Ser Gly Ala Glu His Val Val Leu Ala Gly Arg Arg Gly 
1170 1175 1180 

Gly Gly Val Val Gly Ala Val Glu Leu Glu Arg Glu Leu Val Gly Leu 
5 1185 1190 1195 1200 

Gly Ala Lys Val Thr Phe Val Ser Cys Asp Val Gly Asp Arg Ala Ser 
1205 1210 1215 

io Met Val Gly Leu Leu Gly Val Val Glu Gly Leu Gly Val Pro Leu Arg 

1220 1225 " 1230 

Gly Val Phe His Ala Ala Gly Val Ala Gin Val Ser Gly Leu Gly Glu 
1235 1240 1245 



75 



Val Ser Leu Ala Glu Ala Gly Gly Val Leu Gly Gly Lys Ala Val Gly 
1250 1255 1260 

Ala Glu Leu Leu Asp Glu Leu Thr Ala Gly Val Glu Leu Asp Ala Phe 
1255 1270 1275 1280 

20 Val Leu Phe Ser Ser Gly Ala Gly Val Trp Gly Ser Gly Gly Gin Ser 

1285 1290 1295 

Val Tyr Ala Ala Ala Asn Ala His Leu Asp Ala Leu Ala Glu Arg Arg 
1300 1305 1310 

25 Arg Ala Gin Gly Arg Pro Ala Thr Ser Val Ala Trp Gly Leu Trp Gly 

1315 1320 1325 

Gly Glu Gly Met Gly Ala Asp Glu Gly Val Thr Glu Phe Tyr Ala Glu 
1330 1335 1340 

Arg Gly Leu Ala Pro Met Arg Pro Glu Ser Gly He Glu Ala Leu His 
1345 1350 1355 1360 

Thr Ala Leu Asn Glu Gly Asp Thr Cys Val Thr Val Ala Asp He Asp 
1365 1370 1375 

Trp Glu His Phe Val Thr Gly Phe Thr Ala Tyr Arg Pro Ser Pro Leu 
1380 1385 1390 

He Ser Asp He Pro Gin Val Arg Ala Leu Arg Thr Pro Glu Pro Thr 
40 1395 1400 1405 

Val Asp Ala Ser Asp Gly Leu Arg Arg Arg Val Asp Ala Ala Leu Thr 
1410 1415 1420 



30 



35 



45 



Pro Arg Glu Arg Thr Lys Val Leu Val Asp Leu Val Arg Thr Val Ala 
1425 1430 1435 ~ 1440 

Ala Glu Val Leu Gly His Asp Gly He Gly Gly He Gly His Asp Val 
1445 1450 1455 

50 Ala Phe Arg Asp Leu Gly Phe Asp Ser Leu Ala Ala Val Arg Met Arg 

1460 1465 1470 

Gly Arg Leu Ala Glu Ala Thr Gly Leu Val Leu Pro Ala Thr Val He 

1475 1480 1485 



55 



Phe Asp His Pro Thr Val Asp Arg Leu Gly Gly Ala Leu Leu Glu Arg 
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1490 1495 1500 

Leu Ser Ala Asp Glu Pro Ala Pro Gly Gly Ala Pro Glu Pro Ala Gly 
1505 1510 1515 1520 

Gly Arg Pro Ala Thr Pro Pro Pro Ala Pro Glu Pro Ala Val His Asp 
1525 1530 1535 

Ala Asp lie Asp Glu Leu Asp Ala Asp Ala Leu lie Arg Leu Ala Thr 
1540 1545 1550 

Gly Thr Ala Gly Pro Ala Asp Gly Thr Pro Ala Asp Gly Gly Pro Asp 
1555 1560 1565 

Ala Ala Ala Thr Ala Pro Asp Gly Ala Pro Glu Gin 
1570 1575 1580 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1891 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Ser Pro Ser Met Asp Glu Val Leu Gly Ala Leu Arg Thr Ser Val 
15 10 15 

Lys Glu Thr Glu Arg Leu Arg Arg His Asn Arg Glu Leu Leu Ala Gly 
20 25 30 

Ala His Glu Pro Val Ala He Val Gly Met Ala Cys Arg Tyr Pro Gly 
35 40 45 

Gly Val Ser Thr Pro Asp Asp Leu Trp Glu Leu Ala Ala Asp Gly Val 
50 55 60 

Asp Ala He Thr Pro Phe Pro Ala Asp Arg Gly Trp Asp Glu Asp Ala 
65 70 75 80 

Val Tyr Ser Pro Asp Pro Asp Thr Pro Gly Thr Thr Tyr Cys Arg Glu 
85 90 95 

Gly Gly Phe Leu Thr Gly Ala Gly Asp Phe Asp Ala Ala Phe Phe Gly 
100 105 no 

He Ser Pro Asn Glu Ala Leu Val Met Asp Pro Gin Gin Arg Leu Leu 
115 120 125 

Leu Glu Thr Ser Trp Glu Thr Leu Glu Arg Ala Gly He Val Pro Ala 
130 135 140 

Ser Leu Arg Gly Ser Arg Thr Gly Val Phe Val Gly Ala Ala His Thr 
145 150 155 160 
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Gly Tyr Val Thr Asp Thr Ala Arg Ala Pro Glu Gly Thr Glu Gly Tyr 
165 170 175 

Leu Leu Thr Gly Asn Ala Asp Ala Val Met Ser Gly Arg He Ala Tyr 
180 185 190 

Ser Leu Gly Leu Glu Gly Pro Ala Leu Thr He Gly Thr Ala Cys Ser 
195 200 205 

Ser Ser Leu Val Ala Leu His Leu Ala Val Gin Ser Leu Arg Arg Gly 
210 215 220 

Glu Cys Asp Leu Ala Leu Ala Gly Gly Val Ala Val Met Pro Asp Pro 
225 230 235 240 

Thr Val Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Val Asp Gly 
245 250 255 

Arg Cys Lys Ala Phe Ala Glu Gly Ala Asp Gly Thr Ala Trp Ala Glu 

260 265 270 

Gly Val Gly Val Leu Leu Val Glu Arg Leu Ser Asp Ala Arg Arg Asn 

275 280 285 

Gly His Arg Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gin Asp 
290 295 300 

Gly Ala Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ala Gin Gin Arg 
305 310 315 320 

Val He Arg Glu Ala Leu Ala Asp Ala Gly Leu Thr Pro Ala Asp Val 
325 330 335 

Asp Val Val Glu Ala His Gly Thr Gly Thr Ala Leu Gly Asp Pro He 
340 345 350 

Glu Ala Gly Ala Leu Leu Ala Thr Tyr Gly Arg Glu Arg Val Gly Asp 
355 360 365 

Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn He Gly His Ala Gin Ala 
370 375 380 

Ala Ala Gly Val Gly Gly Val He Lys Val Val Gin Ala Met Arg His 
385 390 395 400 

Gly Ser Leu Pro Arg Thr Leu His Val Asp Ala Pro Ser Ser Lys Val 
405 410 415 

Glu Trp Ala Ser Gly Ala Val Glu Leu Leu Thr Glu Gly Arg Ser Trp 
420 425 430 

Pro Arg Arg Val Glu Arg Val Arg Arg Ala Ala Val Ser Ala Phe Gly 
435 440 445 

Val Ser Gly Thr Asn Ala His Val Val Leu Glu Glu Ala Pro Val Glu 
450 455 460 



Ala Gly Ser Glu His Gly Asp Gly Pro Gly Pro Asp Arg Pro Asp Ala 
465 470 475 480 

Val Thr Gly Pro Leu Pro Trp Val Leu Ser Ala Arg Ser Arg Glu Ala 
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485 



490 



495 



Leu Arg Gly Gin Ala Gly Arg Leu Ala Ala Leu Ala Arg Gin Gly Arg 
500 505 510 

Thr Glu Gly Thr Gly Gly Gly Ser Gly Leu Val Val Pro Ala Ala Asp 
515 520 525 

lie Gly Tyr Ser Leu Ala Thr Thr Arg Glu Thr Leu Glu His Arg Ala 
530 535 540 

Val Ala Leu Val Gin Glu Asn Arg Thr Ala Gly Glu Asp Leu Ala Ala 
545 550 555 560 

Leu Ala Ala Gly Arg Thr Pro Glu Ser Val Val Thr Gly Val Ala Arg 
565 570 575 

Arg Gly Arg Gly lie Ala Phe Leu Cys Ser Gly Gin Gly Ala Gin Arg 
580 585 590 

Leu Gly Ala Gly Arg Glu Leu Arg Gly Arg Phe Pro Val Phe Ala Asp 
595 600 605 

Ala Leu Asp Glu He Ala Ala Glu Phe Asp Ala His Leu Glu Arg Pro 

610 615 620 

Leu Leu Ser Val Met Phe Ala Glu Pro Ala Thr Pro Asp Ala Ala Leu 

625 630 635 640 

Leu Asp Arg Thr Asp Tyr Thr Gin Pro Ala Leu Phe Ala Val Glu Thr 
645 650 655 

Ala Leu Phe Arg Leu Leu Glu Ser Trp Gly Leu Val Pro Asp Val Leu 
660 665 670 

Val Gly His Ser He Gly Gly Leu Val Ala Ala His Val Ala Gly Val 
675 680 685 

Phe Ser Ala Ala Asp Ala Ala Arg Leu Val Ser Ala Arg Gly Arg Leu 
690 695 700 

Met Arg Ala Leu Pro Glu Gly Gly Ala Met Ala Ala Val Gin Ala Thr 
705 710 715 720 

Glu Arg Glu Ala Ala Ala Leu Glu Pro Val Ala Ala Gly Gly Ala Val 
725 730 735 

Val Ala Ala Val Asn Gly Pro Gin Ala Leu Val Leu Ser Gly Asp Glu 
740 745 750 

Ala Ala Val Leu Ala Ala Ala Gly Glu Leu Ala Ala Arg Gly Arg Arg 
755 760 765 

Thr Lys Arg Leu Arg Val Ser His Ala Phe His Ser Pro Arg Met Asp 
770 775 780 

Ala Met Leu Ala Asp Phe Arg Ala Val Ala Asp Thr Val Asp Tyr His 
785 790 795 800 



Ala Pro Arg Leu Pro Val Val Ser Glu Val Thr Gly Asp Leu Ala Asp 
805 810 815 



174 



EP 0 791 655 A2 



Ala Ala Gin Leu Thr Asp Pro Gly Tyr Trp Thr Arg Gin Val Arg Gin 
820 825 830 

Pro Val Arg Phe Ala Asp Ala Val Arg Thr Ala Ser Ala Arg Asp Ala 
835 840 845 

Ala Thr Phe He Glu Leu Gly Pro Asp Ala Val Leu Cys Gly Met Ala 
850 855 860 

Glu Glu Ser Leu Ala Ala Glu Ala Asp Val Val Phe Ala Pro Ala Leu 
865 870 875 880 

Arg Arg Gly Arg Pro Glu Gly Asp Thr Val Leu Arg Ala Ala Ala Ser 
885 890 895 

Ala Tyr Val Arg Gly Ala Gly Leu Asp Trp Ala Ala Leu Tyr Gly Gly 
900 905 910 

Thr Gly Ala Arg Arg Thr Asp Leu Pro Thr Tyr Ala Phe Gin His Ser 
915 920 925 

Arg Tyr Trp Leu Ala Pro Ala Ser Ala Ala Val Ala Pro Ala Thr Ala 
930 935 940 

Ala Pro Ser Val Arg Ser Val Pro Glu Ala Glu Gin Asp Gly Ala Leu 
945 950 955 960 

Trp Ala Ala Val His Ala Gly Asp Val Ala Ser Ala Ala Ala Arg Leu 

965 970 975 

Gly Ala Asp Asp Ala Gly He Glu His Glu Leu Arg Ala Val Leu Pro 
980 985 990 

His Leu Ala Ala Trp His Asp Arg Asp Arg Ala Thr Ala Arg Thr Ala 
995 1000 1005 

Gly Leu His Tyr Arg Val Thr Trp Gin Ala He Glu Ala Asp Ala Val 
1010 1015 1020 

Arg Phe Ser Pro Ser Asp Arg Trp Leu Met Val Glu His Gly Gin His 
1025 1030 1035 1040 

Thr Glu Cys Ala Asp Ala Ala Glu Arg Ala Leu Arg Ala Ala Gly Ala 
1045 1050 1055 

Glu Val Thr Arg Leu Val Trp Pro Leu Glu Gin His Thr Gly Ser Pro 
1060 1065 1070 

Arg Thr Glu Thr Pro Asp Arg Gly Thr Leu Ala Ala Arg Leu Ala Glu 
1075 1080 1085 

Leu Ala Arg Ser Pro Glu Gly Leu Ala Gly Val Leu Leu Leu Pro Asp 
1090 1095 1100 

Ser Gly Gly Ala Ala Val Ala Gly His Pro Gly Leu Asp Gin Gly Thr 
1105 1110 1115 1120 

Ala Ala Val Leu Leu Thr He Gin Ala Leu Thr Asp Ala Ala Val Arg 
1125 1130 1135 
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Ala Pro Leu Trp Val Val Thr Arg Gly Ala Val Ala Val Gly Ser Gly 
1140 1145 1150 

Glu Val Pro Cys Ala Val Gly Ala Arg Val Trp Gly Leu Gly Arg Val 
5 1155 1160 1165 

Ala Ala Leu Glu Val Pro Val Gin Trp Gly Gly Leu Val Asp Val Ala 
1170 1175 1180 

10 v *l Gly Ala Gly Val Arg Glu Trp Arg Arg Val Val Gly Val Val Ala 

1185 1190 1195 1200 

Gly Gly Gly Glu Asp Gin Val Ala Val Arg Gly Gly Gly Val Phe Gly 
1205 1210 1215 

75 Arg Arg Leu Val Gly Val Gly Val Arg Gly Gly Ser Gly Val Trp Arg 

1220 1225 1230 

Ala Arg Gly Cys Val Val Val Thr Gly Gly Leu Gly Gly Val Gly Gly 
1235 1240 " 1245 

His Val Ala Arg Trp Leu Ala Arg Ser Gly Ala Glu His Val Val Leu 
1250 1255 1260 

Ala Gly Arg Arg Gly Gly Gly Val Val Gly Ala Val Glu Leu Glu Arg 
1265 1270 1275 1280 

Glu Leu Val Gly Leu Gly Ala Lys Val Thr Phe Val Ser Cys Asp Val 
1285 1290 1295 

Gly Asp Arg Ala Ser Val Val Gly Leu Leu Gly Val Val Glu Gly Leu 
30 1300 1305 1310 

Gly Val Pro Leu Arg Gly Val Phe His Ala Ala Gly Val Ala Gin Val 

1315 1320 1325 

Ser Gly Leu Gly Glu Val Ser Leu Ala Glu Ala Gly Gly Val Leu Gly 
35 1330 1335 1340 

Gly Lys Ala Val Gly Ala Glu Leu Leu Asp Glu Leu Thr Ala Gly Val 
1345 1350 1355 1360 

Glu Leu Asp Ala Phe Val Leu Phe Ser Ser Gly Ala Gly Val Trp Gly 
40 1365 1370 " 1375 

Ser Gly Gly Gin Ser Val Tyr Ala Ala Ala Asn Ala His Leu Asp Ala 
1380 1385 1390 
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55 



Leu Ala Glu Arg Arg Arg Ala Gin Gly Arg Pro Ala Thr Ser Val Ala 
1395 1400 1405 

Trp Gly Pro Trp Asp Gly Asp Gly Met Gly Glu Met Ala Pro Glu Gly 
1410 1415 1420 

Tyr Phe Ala Arg His Gly Val Ala Pro Leu His Pro Glu Thr Ala Leu 
1425 1430 1435 1440 

Thr Ala Leu His Gin Ala He Asp Gly Gly Glu Ala Thr Val Thr Val 
1445 1450 1455 

Ala Asp He Asp Trp Glu Arg Phe Ala Pro Gly Phe Thr Ala Phe Arg 
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1460 1465 1470 

Pro Ser Pro Leu lie Ala Gly He Pro Ala Ala Arg Thr Ala Pro Ala 
1475 1480 1485 

Ala Gly Arg Pro Ala Glu Asp Thr Pro Thr Ala Pro Gly Leu Leu Arg 
1490 1495 1500 

Ala Arg Pro Glu Asp Arg Pro Arg Leu Ala Leu Asp Leu Val Leu Arg 
1505 1510 1515 1520 

His Val Ala Ala Val Leu Gly His Ser Glu Asp Ala Arg Val Asp Ala 
1525 1530 1535 

Arg Ala Pro Phe Arg Asp Leu Gly Phe Asp Ser Leu Ala Ala Val Arg 
75 1540 1545 1550 

Leu Arg Arg Arg Leu Ala Glu Asp Thr Gly Leu Asp Leu Pro Gly Thr 
1555 1560 1565 



10 



20 



Leu Val Phe Asp His Glu Asp Pro Thr Ala Leu Ala His His Leu Ala 
1570 1575 1580 

Gly Leu Ala Asp Ala Gly Thr Pro Gly Pro Gin Glu Gly Thr Ala Arg 
1585 1590 1595 1600 

25 Ala Glu Ser Gly Leu Phe Ala Ser Phe Arg Ala Ala Val Glu Gin Arg 

1605 1610 1615 

Arg Ser Ser Glu Val Val Glu Leu Met Ala Asp Leu Ala Ala Phe Arg 
1620 1625 1630 

30 

Pro Ala Tyr Ser Arg Gin His Pro Gly Ser Gly Arg Pro Ala Pro Val 
1635 1640 1645 

Pro Leu Ala Thr Gly Pro Ala Thr Arg Pro Thr Leu Tyr Cys Cys Ala 
1650 1655 1660 

Gly Thr Ala Val Gly Ser Gly Pro Ala Glu Tyr Val Pro Phe Ala Glu 
1665 1670 1675 1680 

Gly Leu Arg Gly Val Arg Glu Thr Val Ala Leu Pro Leu Ser Gly Phe 
1685 1690 1695 

Gly Asp Pro Ala Glu Pro Met Pro Ala Ser Leu Asp Ala Leu He Glu 
1700 1705 1710 

Val Gin Ala Asp Val Leu Leu Glu His Thr Ala Gly Lys Pro Phe Ala 
1715 1720 1725 

Leu Ala Gly His Ser Ala Gly Ala Asn He Ala His Ala Leu Ala Ala 
1730 1735 1740 

Arg Leu Glu Glu Arg Gly Ser Gly Pro Ala Ala Val Val Leu Met Asp 
so 1745 1750 1755 1760 

Val Tyr Arg Pro Glu Asp Pro Gly Ala Met Gly Glu Trp Arg Asp Asp 
1765 1770 1775 



35 
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Leu Leu Ser Trp Ala Leu Glu Arg Ser Thr Val Pro Leu Glu Asp His 
1780 1785 1790 
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Arg Leu Thr Ala Met Ala Gly Tyr Gin Arg Leu Val Leu Gly Thr Arg 
1795 1800 1805 

Leu Thr Ala Leu Glu Ala Pro Val Leu Leu Ala Arg Ala Ser Glu Pro 
1810 1815 1820 

Leu Cys Ala Trp Pro Pro Ala Gly Gly Ala Arg Gly Asp Trp Arg Ser 
1825 1830 1835 1840 

Gin Val Pro Phe Ala Arg Thr Val Ala Asp Val Pro Gly Asn His Phe 
1845 1850 1855 

Thr Met Leu Thr Glu His Ala Arg His Thr Ala Ser Leu Val His Glu 
1860 1865 1870 

Trp Leu Asp Ser Leu Pro His Gin Pro Gly Pro Ala Pro Leu Thr Gly 
1875 1880 1885 

Gly Lys His 
1890 



(2) INFORMATION FOR SEQ ID NO: 13: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13987 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 350.. 13987 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

GACCGCTCGG GGAGACCTGA CATATTCGTC GCGAAGTGGT TGTCCGCGCC GCGAGGTACT 60 

40 GAAATCTTCT CCGCTCGCCC AGGACTCCGC GTGCAGGTCA CCGGAGTGCG CGACCGGCCG 120 

GGACGTCGGA GCGCCGACCC TGCGGACCTG GTGCGATGCC GTGTGGTCCC GCATGATCCC 180 

GCGCCGTCTC CGGTGACGAG AATCGGTGGA CAATCTCCGA ACTTGACACA ATTGATTGTC 240 

45 GTTCACCGGC CGTTCCTGTC GCCCGGCAGT TCGCCCGCTG TACGCTCGGG AAGATCAAGA 300 

AAAGGCAGAA AAGCCACGGC GTGGTACGGC GAACATATGA GGGATGCAG GTG TCT 355 

Met Ser 
1 

50 

GGA GAA CTC GCG ATT TCC CGC ACT CAC GAC CGG TCC GAC GCC GTT GCC 403* 
Gly Glu Leu Ala He Ser Arg Ser Asp Asp Arg Ser Asp Ala Val Ala 
5 10 15 

55 GTG GTC GGA ATG GCG TGC CGG TTT CCC GGC GCC CCG GGA ATT GCC GAA 451 

Val Val Gly Met Ala Cys Arg Phe Pro Gly Ala Pro Gly He Ala Glu 



(i) 
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20 25 30 

TTC TGG GAA CTG CTC CGC AGC GGA CGC GGT ATC CCC ACC CGT CAG GAC 499 
Phe Trp Glu Leu Leu Arg Ser Gly Arg Gly Met Pro Thr Arg Gin Asp 
35 40 45 50 

GAC GGC ACC TGG CGG GCC GCC CTG GAG GAC CAC GCC GGC TTC GAC GCC 547 
Asp Gly Thr Trp Arg Ala Ala Leu Glu Asp His Ala Gly Phe Asp Ala 
55 60 . . $5- 

GGG TTC TTC GGC ATG AAC GCC CGG CAG GCC GCC GCC ACC GAC CCG CAG 595 
Gly Phe Phe Gly Met Asn Ala Arg Gin Ala Ala Ala Thr Asp Pro Gin 
70 75 80 

CAC CGA CTG ATG CTG GAA CTC GGA TGG GAG GCA CTG GAG GAC GCG GGC 643 
His Arg Leu Met Leu Glu Leu Gly Trp Glu Ala Leu Glu Asp Ala Gly 
85 90 95 

ATC GTC CCC GGC GAT CTC ACC GGC ACC GAC ACC GGA GTC TTC GCC GGC 691 
lie Val Pro Gly Asp Leu Thr Gly Thr Asp Thr Gly Val Phe Ala Gly 
100 105 110 

GTG GCG TCC GAC GAC TAT GCC GTT CTC ACC CGC CGT TCC GCC GTC TCC 739 
Val Ala Ser Asp Asp Tyr Ala Val Leu Thr Arg Arg Ser Ala Val Ser 
115 120 125 130 

25 GCC GGG GGA TAC ACC GCC ACG GGG CTG CAC CGC GCC CTG GCC GCC AAC 787 

Ala Gly Gly Tyr Thr Ala Thr Gly Leu His Arg Ala Leu Ala Ala Asn 
135 140 145 

CGC CTC TCC CAC TTC CTG GGC CTG CGC GGC CCC AGC CTG GTC GTC GAC 835 
3Q Arg Leu Ser His Phe Leu Gly Leu Arg Gly Pro Ser Leu Val Val Asp 

150 155 160 

TCG GCC CAG TCC GCC TCA CTG GTG GCC GTC CAG CTC GCC TCC GAG AGT 883 
Ser Ala Gin Ser Ala Ser Leu Val Ala Val Gin Leu Ala Cys Glu Ser 
165 170 175 

35 

CTG CGC CGG GGT GAG ACG TCG CTC GCC GTC GCG GGC GGT GTC AAC CTC 931 
Leu Arg Arg Gly Glu Thr Ser Leu Ala Val Ala Gly Gly Val Asn Leu 
180 185 190 

40 ATC CTC ACC GAG GAG AGC ACC ACC GTC ATG GAG CGT ATG GGA GCG CTC 979 

He Leu Thr Glu Glu Ser Thr Thr Val Met Glu Arg Met Gly Ala Leu 
195 200 205 210 

TCA CCC GAC GGC CGC TGC CAC ACC TTC GAC GCC CGC GCC AAC GGC TAC 1027 
Ser Pro Asp Gly Arg Cys His Thr Phe Asp Ala Arg Ala Asn Gly Tyr 
215 220 225 

GTA CGC GGC GAG GGC GGC GGA GCC GTC GTG CTC AAG CCA CTG GAC GCC 1075 
Val Arg Gly Glu Gly Gly Gly Ala Val Val Leu Lys Pro Leu Asp Ala 
230 235 240 

GCA CTC GCC GAC GGC GAC CGC GTG TAC TGC GTC ATC AAG GGA GGT GCC 1123 
Ala Leu Ala Asp Gly Asp Arg Val Tyr Cys Val He Lys Gly Gly Ala 
245 250 255 

55 GTC AAC GAC GGC GGC GGC GCG AGC CTC ACC ACT CCC GAC CGG GAG 1171 

Val Asn Asn Asp Gly Gly Gly Ala Ser Leu Thr Thr Pro Asp Arg Glu 
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260 265 270 

GCG CAG GAA GCT GTG CTG CGC CAG GCC TAC CGG CGG GCG GGC GTC AGC 1219 
Ala Gin Glu Ala Val Leu Arg Gin Ala Tyr Arg Arg Ala Gly Val Ser 
5 275 280 285 290 

ACC GGC GCC GTC CGC TAC GTC GAG CTG CAC GGG ACC GGC ACC CGG GCC 1267 
Thr Gly Ala Val Arg Tyr Val Glu Leu His Gly Thr Gly Thr Arg Ala 
295 300 305 

10 

GGC GAC CCC GTC GAG GCG GCC GCA CTG GGC GCC GTG CTC GGG GCG GGG 1315 
Gly Asp Pro Val Glu Ala Ala Ala Leu Gly Ala Val Leu Gly Ala Gly 
310 315 320 

GCG GAC AGC GGC CGC AGC ACG CCG CTC GCC GTC GGC TCG GTG AAG ACC 1363 
15 Ala Asp Ser Gly Arg Ser Thr Pro Leu Ala Val Gly Ser Val Lys Thr 

325 330 335 

AAC GTC GGC CAT CTG GAG GGC GCG GCG GGC ATC GTC GGA CTG ATC AAG 1411 
Asn Val Gly His Leu Glu Gly Ala Ala Gly lie Val Gly Leu He Lys 
20 3 4 0 3 4 5 3 5 0 

GCC ACG CTG TGC GTA CGG AAG GGC GAA CTC GTC CCC AGC CTC AAC TTC 1459 
Ala Thr Leu Cys Val Arg Lys Gly Glu Leu Val Pro Ser Leu Asn Phe 
355 360 365 370 

25 AGC ACG CCG AAC CCT GAC ATC CCC CTC GAC GAC CTG CGG CTG CGC GTC 1507 

Ser Thr Pro Asn Pro Asp He Pro Leu Asp Asp Leu Arg Leu Arg Val 
375 380 385 

CAG ACC GAA CGG CAG GAG TGG AAC GAG GAG GAC GAC CGG CCG CGC GTG 1555 
^ Gin Thr Glu Arg Gin Glu Trp Asn Glu Glu Asp Asp Arg Pro Arg Val 

390 395 400 

GCC GGC GTC TCC TCC TTC GGT ATG GGC GGA ACC AAT GTC CAC CTC GTG 1603 
Ala Gly Val Ser Ser Phe Gly Met Gly Gly Thr Asn Val His Leu Val 
405 410 415 

35 

ATC GCG GAG GCT CCG GCC GCG GCG GGG TCC TCC GGG GCG GGG GGT TCG 1651 
He Ala Glu Ala Pro Ala Ala Ala Gly Ser Ser Gly Ala Gly Gly Ser 
420 425 430 

40 GGC GCT GGT TCC GGT GCC GGT ATC AGC GCT GTT TCT GGT GTG GTG CCG 1699 

Gly Ala Gly Ser Gly Ala Gly He Ser Ala Val Ser Gly Val Val Pro 
435 440 445 450 



45 



GTG GTG GTT TCG GGG CGT TCG CGG GTG GTG GTG CGG GAG GCT GCG GGC 1747 
Val Val Val Ser Gly Arg Ser Arg Val Val Val Arg Glu Ala Ala Gly 
455 460 465 



CGG TTG GCG GAG GTG GTG GAG GCC GGT GGT GTG GGG CTG GCG GAT GTG 1795 
Arg Leu Ala Glu Val Val Glu Ala Gly Gly Val Gly Leu Ala Asp Val 
470 475 480 

so 

GCG GTG ACG ATG GCG GAC CGG TCG CGG TTT GGG TAT CGG GCG GTT GTG 1843 
Ala Val Thr Met Ala Asp Arg Ser Arg Phe Gly Tyr Arg Ala Val Val 
485 490 495 

55 CTG GCT CGG GGT GAG GCT GAG CTT GCC GGG CGT TTG CGG GCG TTG GCG 1891 

Leu Ala Arg Gly Glu Ala Glu Leu Ala Gly Arg Leu Arg Ala Leu Ala 
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500 505 510 

GGG GOT GAT CCG GAC GCG GGT GTG GTC ACC GGT GCG GTT CTC GAC GGT 1939 
Gly Gly Asp Pro Asp AlaGly Val Val Thr Gly Ala Val Leu Asp Gly 
515 520 525 530 

GGT GTG GTT GTC GGT GCT GCC CCC GGC GGT GCC GGT GCT GCC GGT GGT 1987 
Gly Val Val Val Gly Ala Ala Pro Gly Gly Ala Gly Ala Ala Gly Gly 
535 540 545 

GCC GGT GCT GCC GGT GGT GCC GGT GGT GGG GGC GTG GTG TTG GTT TTC 2035 
Ala Gly Ala Ala Gly Gly Ala Gly Gly Gly Gly Val Val Leu Val Phe 
550 555 560 

CCT GGT CAG GGG ACG CAG TGG GTG GGG ATG GGT GCG GGG CTG CTC GGG 2083 
Pro Gly Gin Gly Thr Gin Trp Val Gly Met Gly Ala Gly Leu Leu Gly 
565 570 575 

TCT TCG GAG GTG TTT GCG GCG TCG ATG CGG GAG TGT GCG CGG GCG CTG 2131 
Ser Ser Glu Val Phe Ala Ala Ser Met Arg Glu Cys Ala Arg Ala Leu 
580 585 590 

AGT GTT CAT GTG GGG TGG GAT TTG CTG GAG GTG GTG TCG GGC GGG GCC 2179 
Ser Val His Val Gly Trp Asp Leu Leu Glu Val Val Ser Gly Gly Ala 
595 600 60S 610 

25 GGG TTG GAG CGG GTG GAT GTG GTG CAG CCG GTC ACG TCG GCG GTG ATG 2227 

Gly Leu Glu Arg Val Asp Val Val Gin Pro Val Thr Trp Ala Val Met 
615 620 625 
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GTG TCG CTC GCC CGG TAC TCG CAG GCG ATG GGT GTG GAC GTG GCT GCG 2275 
Val Ser Leu Ala Arg Tyr Trp Gin Ala Met Gly Val Asp Val Ala Ala 
630 635 640 

GTG GTC GGT CAT TCC CAG GGG GAG ATC GCC GCT GCC ACG GTG GCG GGG 2323 
Val Val Gly His Ser Gin Gly Glu He Ala Ala Ala Thr Val Ala Gly 
645 650 655 

GCG TTC TCG CTC GAG GAT GCG GCG GCT GTC GTC GCT CTC CGG GCG GGG 2371 
Ala Leu Ser Leu Glu Asp Ala Ala Ala Val Val Ala Leu Arg Ala Gly 
660 665 670 

TTC ATT GGC CGG TAT CTC GCG GGT CGT GGT GCG ATC GCG GCT GTT CCG 2419 
Leu He Gly Arg Tyr Leu Ala Gly Arg Gly Ala Met Ala Ala Val Pro 
675 680 685 690 

CTC CCT GCC GGC GAG GTC GAG GCC GGG CTC GCG AAG TCG CCG GGT GTC 2467 
Leu Pro Ala Gly Glu Val Glu Ala Gly Leu Ala Lys Trp Pro Gly Val 
695 700 705 

GAG GTC GCG GCG GTC AAC GGT CCG GCG TCT ACG GTG GTT TCC GGG GAT 2515 
Glu Val Ala Ala Val Asn Gly Pro Ala Ser Thr Val Val Ser Gly Asp 
710 715 720 

CGG CGG GCG GTC GCC GGT TAT GTC GCC GTC TGT CAG GCG GAG GGT GTC 2563 
Arg Arg Ala Val Ala Gly Tyr Val Ala Val Cys Gin Ala Glu Gly Val 
725 730 735 

CAG GCC CGG TTG ATA CCG GTC GAC TAC GCC TCT CAC TCC CGC CAT GTG 2611 
Gin Ala Arg Leu He Pro Val Asp Tyr Ala Ser His Ser Arg His Val 
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740 745 750 

GAG GAC CTG AAG GGC GAG TTG GAG CGG GTG CTG TCC GGT ATC CGC CCC 2659 
Glu Asp Leu Lys Gly Glu Leu Glu Arg Val Leu Ser Gly lie Arg Pro 
755 760 765 770 

CGC AGT CCG CGG GTG CCG GTG TGT TCC ACC GTC GCC GGA GAG CAG CCG 2707 
Arg Ser Pro Arg Val Pro Val Cys Ser Thr Val Ala Gly Glu Gin Pro 
775 780 785 

GGC GAG CCG GTT TTC GAT GCG GGG TAT TGG TTC CGT AAT CTG CGG AAC 2755 
Gly Glu Pro Val Phe Asp Ala Gly Tyr Trp Phe Arg Asn Leu Arg Asn 
790 795 ~ 800 

CGG GTT GAG TTC TCC GCG GTG GTC GGT GGT TTG TTG GAG GAG GGC CAC 2803 
Arg Val Glu Phe Ser Ala Val Val Gly Gly Leu Leu Glu Glu Gly His 
805 810 815 

CGT CGG TTC ATC GAG GTC AGT GCC CAC CCG GTA CTC GTC CAT GCG ATC 2851 
Arg Arg Phe He Glu Val Ser Ala His Pro Val Leu Val His Ala He 
820 825 830 

GAG CAG ACG GCC GAG GCC GCG GAC CGG AGT GTC CAT GCC ACC GGG ACC 2899 
Glu Gin Thr Ala Glu Ala Ala Asp Arg Ser Val His Ala Thr Gly Thr 
835 840 845 850 

25 CTG CGC CGC CAG GAC GAC AGC CCG CAC CGC CTG CTG ACC TCC ACC GCC 2947 

Leu Arg Arg Gin Asp Asp Ser Pro His Arg Leu Leu Thr Ser Thr Ala 
855 860 865 

GAG GCC TGG GCC CAC GGC GCC ACC CTC ACC TGG GAC CCC GCC CTG CCC 2995 
30 Glu Ala Trp Ala His Gly Ala Thr Leu Thr Trp Asp Pro Ala Leu Pro 

870 875 880 

CCA GGC CAC CTC ACC ACC CTC CCC ACC TAC CCC TTC AAC CAC CAC CAC 3043 

Pro Gly His Leu Thr Thr Leu Pro Thr Tyr Pro Phe Asn His His His 

885 890 895 

35 

TAC TGG CTC GAC ACC ATT GAC GGG GGC GGA GGG GAC GAC GCG ACC CAG 3091 

Tyr Trp Leu Asp Thr He Asp Gly Gly Gly Gly Asp Asp Ala Thr Gin 

900 905 910 

40 GAG AAG GAG AGC GGC CCT CTG ACG CGG GAA CTG CGT GGG CTG CCG TCC 3139 

Glu Lys Glu Ser Gly Pro Leu Thr Arg Glu Leu Arg Gly Leu Pro Ser 
915 920 ' 925 930 

TCT CAG AAG CAA CTG GGT TTC CTG CTC GAT CTG GTG TGC CGG CAC ACG 3187 
Ser Gin Lys Gin Leu Gly Phe Leu Leu Asp Leu Val Cys Arg His Thr 
935 940 945 

GCC GTC GTA CTC GGC CTG GAC ACG GCC GCC GAG GTG GAC CCG GAC CTG 3235 
Ala Val Val Leu Gly Leu Asp Thr Ala Ala Glu Val Asp Pro Asp Leu 
950 955 960 

TCC TTC AAG AAG CAG GGC ATC CAG TCC ATG ACC GGC GTC GAG CTG CGC 3283 
Ser Phe Lys Lys Gin Gly He Gin Ser Met Thr Gly Val Glu Leu Arg 
965 970 975 

AAC AGG CTG CTG ACC GAG ACC GGC CTG GCA TTG CCC ACC ACC CTC GTC 3331 
Asn Arg Leu Leu Thr Glu Thr Gly Leu Ala Leu Pro Thr Thr Leu Val 
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980 985 990 

TAC GAC CGG CCC ACC CCT CGC GCC CTG GCG CAG TTC CTC CAC ACC GAG 3379 
Tyr Asp Arg Pro Thr Pro Arg Ala Leu Ala Gin Phe Leu His Thr Glu 
995 1000 1005 1010 

TTG CTC GAC GGC TCC CCC TCG GGC TCC GTC CTC GCA CCG GCG CAG AAG 3427 
Leu Leu Asp Gly Ser Pro Ser Gly Ser Val Leu Ala Pro Ala Gin Lys 
1015 1020 1025 

AGC TTC GAG GCC GGC GGG CCC GGA GTG CTC TCG TCG GCC GCG GTA GGG 3475 
Ser Phe Glu Ala Gly Gly Pro Gly Val Leu Ser Ser Ala Ala Val Gly 
1030 1035 1040 

GTG TCG GAC GCC CGG GGC GGC AGC CGG GAC GAC GAC GAC CCG ATC GCC 3523 
Val Ser Asp Ala Arg Gly Gly Ser Arg Asp Asp Asp Asp Pro He Ala 
1045 1050 1055 

ATC GTG GGT GTC GGC TGC CGG CTC CCC GGC GGC GTC GAC TCG CGC GCC 3571 
He Val Gly Val Gly Cys Arg Leu Pro Gly Gly Val Asp Ser Arg Ala 
1060 1065 1070 

GCT CTC TGG GAG CTG CTG GAG TCC GGC GCC GAC GCC ATC TCG TCC TTC 3619 
Ala Leu Trp Glu Leu Leu Glu Ser Gly Ala Asp Ala He Ser Ser Phe 
1075 1080 1085 1090 

25 CCC ACC GAC CGC GGC TGG GAC CTC GAC GGG CTG TAC GAC CCC GAG CCC 3667 

Pro Thr Asp Arg Gly Trp Asp Leu Asp Gly Leu Tyr Asp Pro Glu Pro 
1095 1100 " 1105 

GGG ACG CCC GGC AAG ACC TAT GTG CGG GAG GGC GGG TTC CTG CAC TCG 3715 
33 Gly Thr Pro Gly Lys Thr Tyr Val Arg Glu Gly Gly Phe Leu His Ser 

1110 1115 1120 

GCG GCC GAG TTC GAC GCG GAG TTC TTC GGG ATA TCG CCG CGC GAG GCC 3763 
Ala Ala Glu Phe Asp Ala Glu Phe Phe Gly He Ser Pro Arg Glu Ala 
H25 1130 1135 

35 

ACG GCC ATG GAC CCG CAG CAG CGC TTG CTG CTG GAA GCG TCG TGG GAG 3811 
Thr Ala Met Asp Pro Gin Gin Arg Leu Leu Leu Glu Ala Ser Trp Glu 
1140 H45 1150 

40 GCC CTC GAG GAC GCC GGA GTG CTC CCC GAG TCA CTG CGC GGC GGC GAC 3859 

Ala Leu Glu Asp Ala Gly Val Leu Pro Glu Ser Leu Arg Gly Gly Asp 
1155 H60 1165 ~ 1170 

GCC GGA GTG TTC GTC GGC GCC ACC GCA CCG GAG TAC GGG CCG AGG CTT 3907 
Ala Gly Val Phe Val Gly Ala Thr Ala Pro Glu Tyr Gly Pro Arg Leu 
1175 H80 1185 

CAC GAG GGA GCG GAC GGA TAC GAG GGG TAC CTG CTC ACC GGC ACC ACC 3955 
His Glu Gly Ala Asp Gly Tyr Glu Gly Tyr Leu Leu Thr Gly Thr Thr 
1190 1195 1200 

GCG AGC GTG GCC TCC GGC CGG ATC GCC TAC ACC CTC GGC ACC GGC GGA 4003 
Ala Ser Val Ala Ser Gly Arg He Ala Tyr Thr Leu Gly Thr Gly Gly 
1205 1210 1215 

CCG GCG CTC ACC GTC GAC ACC GCG TGC TCC TCG TCC CTG GTG GCG CTC 4051 
Pro Ala Leu Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu 
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1460 1465 1470 

CAT CTG ATC ATC GAG GAG GCG CCG GAG TCG GTC GAG GAC ATC GAC GGC 4819 
His Leu He He Glu Glu Ala Pro Glu Trp Val Glu Asp He Asp Gly 
1^75 1480 1485 1490 

GTC GCT GCT CCT GAC CGC GGT ACC GCG GAC GCG GCT GCT CCG TCG CCG 4867 
Val Ala Ala Pro Asp Arg Gly Thr Ala Asp Ala Ala Ala Pro Ser Pro 
1495 1500 1505 

CTG TTG TTG TCC GCG CGG TCG GAG GGG GCG TTG CGG GCG CAG GCG GTG 4915 
Leu Leu Leu Ser Ala Arg Ser Glu Gly Ala Leu Arg Ala Gin Ala Val 
1510 1515 1520 

CGG TTG GGT GAG TAC GTG GAG CGG GTG GGT GCG GAT CCG CGG GAT GTC 4963 
Arg Leu Gly Glu Tyr Val Glu Arg Val Gly Ala Asp Pro Arg Asp Val 
1525 1530 1535 

GCT TAT TCG CTG GCT TCG ACG CGG ACT CTT TTC GAG CAC CGT GCG GTG 5011 
Ala Tyr Ser Leu Ala Ser Thr Arg Thr Leu Phe Glu His Arg Ala Val 
1540 1545 1550 

GTG CCG TGT GGT GGG CGT GGG GAG CTC GTC GCT GCT CTT GGT GGG TTT 5059 
Val Pro Cys Gly Gly Arg Gly Glu Leu Val Ala Ala Leu Gly Gly Phe 
1555 1560 1565 1570 

25 GCT GCC GGG AGG GTG TCT GGG GGT GTG CGG TCC GGG CGG GCT GTG CCG 5107 

Ala Ala Gly Arg Val Ser Gly Gly Val Arg Ser Gly Arg Ala Val Pro 
1575 1580 1585 

GGT GGG GTG GGG GTG TTG TTC ACG GGT CAG GGT GCG CAG TCG GTT GGT 5155 
30 Gly Gly Val Gly Val Leu Phe Thr Gly Gin Gly Ala Gin Trp Val Gly 

1590 1595 1600 

ATG GGG CGT GGG TTG TAT GCG GGG GGT GGG GTG TTT GCG GAG GTG CTG 5203 
Met Gly Arg Gly Leu Tyr Ala Gly Gly Gly Val Phe Ala Glu Val Leu 
1605 1610 1615 

35 

GAT GAG GTG TTG TCG ATC GTC GGG GAG GTG GAT GGT CGG TCG TTG CGG 5251 
Asp Glu Val Leu Ser Met Val Gly Glu Val Asp Gly Arg Ser Leu Arg 
1620 1625 1630 

40 GAT GTG ATC TTC GGC GAC GTC GAC GTG GAC GCG GGT GCC GGG GCT GAT 5299 

Asp Val Met Phe Gly Asp Val Asp Val Asp Ala Gly Ala Gly Ala Asp 
1635 1640 1645 * ' 1650 

GCG GGT GCC GGT GCG GGT GCT GGG GTC GGT TCT GGT TCC GGT TCT GTG 5347 
Ala Gly Ala Gly Ala Gly Ala Gly Val Gly Ser Gly Ser Gly Ser Val 
1655 1660 1665 

GGT GGG TTC TTC GGT CGG ACG GAG TTT GCT CAG CCT GCG CTG TTT GCG 5395 
Gly Gly Leu Leu Gly Arg Thr Glu Phe Ala Gin Pro Ala Leu Phe Ala 
1670 1675 1680 

TTC GAG GTC GCG TTC TTC CGG GCG TTG GAG GCT CGG GGT GTG GAG GTC 5443 
Leu Glu Val Ala Leu Phe Arg Ala Leu Glu Ala Arg Gly Val Glu Val 
1685 1690 1695 

55 TCG GTG GTC TTG GGT CAT TCG GTC GGG GAG GTC GCT GCT GCG TAT GTG 5491 

Ser Val Val Leu Gly His Ser Val Gly Glu Val Ala Ala Ala Tyr Val 
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1700 1705 1710 

GCG GGG GTC TTC TCG TTG GGT GAT GCG GTC COG TTC GTG GTC GCG CGG 5539 
Ala Gly Val Leu Ser Leu Gly Asp Ala Val Arg Leu Val Val Ala Arg 
I 7 " 1720 1725 1730 

GGT GGG TTG ATG GGT GGG TTG CCG GTG GGT GGG GGG ATG TGG TCG GTG 5587 
Gly Gly Leu Met Gly Gly Leu Pro Val Gly Gly Gly Met Trp Ser Val 
1735 1740 1745 

GGG GCG TCG GAG TCG GTG GTG CGG GGG GTT GTT GAG GGG TTG GGG GAG 5635 
Gly Ala Ser Glu Ser Val Val Arg Gly Val Val Glu Gly Leu Gly Glu 
1750 1755 1760 

TGG GTG TCG GTT GCG GCG GTG AAT GGG CCG CGG TCG GTG GTG TTG TCG 5683 
Trp Val Ser Val Ala Ala Val Asn Gly Pro Arg Ser Val Val Leu Ser 
1765 1770 1775 

GGT GAT GTG GGT GTG CTG GAG TCG GTG GTT GCC TCG CTG ATG GGG GAT 5731 
Gly Asp Val Gly Val Leu Glu Ser Val Val Ala Ser Leu Met Gly Asp 
20 I 780 1785 1790 

GGG GTG GAG TGC CGG CGG TTG GAT GTG TCG CAT GGG TTT CAT TCG GTC 5779 
Gly Val Glu Cys Arg Arg Leu Asp Val Ser His Gly Phe His Ser Val 
1795 1800 1805 1810 

25 I" 1 * 5 ATG GAG CCG GTG TTG GGG GAG TTC CGG GGG GTT GTG GAG TCG TTG 5827 

Leu Met Glu Pro Val Leu Gly Glu Phe Arg Gly Val Val Glu Ser Leu 
1615 1820 1825 

GAG TTC GGT CGG GTG CGG CCG GGT GTG GTG GTG GTG TCG GGT GTG TCG 5875 
30 Glu Phe Gly Arg Val Arg Pro Gly Val Val Val Val Ser Gly Val Ser 

1830 1835 1840 

GGT GGG GTG GTG GGT TCG GGG GAG TTG GGG GAT CCG GGG TAT TGG GTG 5923 
Gly Gly Val Val Gly Ser Gly Glu Leu Gly Asp Pro Gly Tyr Trp Val 
35 1845 1850 1855 

CGT CAT GCG CGG GAG GCG GTG CGT TTC GCG GAT GGG GTC GGG GTG GTG 5971 
Arg His Ala Arg Glu Ala Val Arg Phe Ala Asp Gly Val Gly Val Val 
I860 1865 1870 

40 CGT GGT CTG GGT GTG GGG ACG TTG GTG GAG GTG GGT CCG CAT GGG GTG 6019 

Arg Gly Leu Gly Val Gly Thr Leu Val Glu Val Gly Pro His Gly Val 
1875 1 88 0 1885 1890 

CTG ACG GGG ATG GCG GGT GAG TGC CTG GGG GCC GGT GAT GAT GTG GTC 6067 
Leu Thr Gly Met Ala Gly Glu Cys Leu Gly Ala Gly Asp Asp Val Val 
1895 1900 1905 

GTC GTG CCG GCG ATG CGG CGG GGC CGT GCG GAG CGG GAG GTG TTC GAG 6115 
Val Val Pro Ala Met Arg Arg Gly Arg Ala Glu Arg Glu Val Phe Glu 
1910 1915 1920 

GCG GCG CTG GCG ACG GTC TTC ACC CGG GAC GCC GGC CTG GAC GCC ACG 6163 
Ala Ala Leu Ala Thr Val Phe Thr Arg Asp Ala Gly Leu Asp Ala Thr 
1925 1930 1935 

55 CCA CTC CAC ACC GGG AGC ACC GGC CGG CGC ATC GAC CTC CCC ACC TAC 6211 

Ala Leu His Thr Gly Ser Thr Gly Arg Arg lie Asp Leu Pro Thr Tyr 
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1940 1945 1950 

CCC TTC CAA CGC GAC CGC TAC TGG CTG GAC CCC GTT CGC ACC GCC GTG 6259 
Pro Phe Gin Arg Asp Arg Tyr Trp Leu Asp Pro Val Arg Thr Ala Val 
1955 1960 1965 1970 

ACC GGC GTC GAG CCC GCC GGC TCG CCG GCG GAC GCT CGG GCC ACT GAG 6307 
Thr Gly Val Glu Pro Ala Gly Ser Pro Ala Asp Ala Arg Ala Thr Glu 
1975 1980 " 1985 

CGG GGA CGG TCG ACG ACG GCC GGG ATC CGC TAC CGC GTC GCT TGG CAG 6355 
Arg Gly Arg Ser Thr Thr Ala Gly lie Arg Tyr Arg Val Ala Trp Gin 
1990 1995 2000 

CCG GCC GTC GTC GAC CGC GGC AAC CCC GGG CCT GCC GGT CAT GTG CTG 6403 
Pro Ala Val Val Asp Arg Gly Asn Pro Gly Pro Ala Gly His Val Leu 
2005 2010 2015 

CTT CTG GCC CCG GAC GAG GAC ACG GCC GAC TCC GGA CTC GCC CCC GCG 6451 
Leu Leu Ala Pro Asp Glu Asp Thr Ala Asp Ser Gly Leu Ala Pro Ala 
2020 2025 2030 

ATC GCA CGT GAA CTC GCC GTG CGC GGG GCC GAG GTC CAC ACC GTC GCC 6499 
lie Ala Arg Glu Leu Ala Val Arg Gly Ala Glu Val His Thr Val Ala 
2035 2040 2045 2050 

25 GTG CCG GTC GGT ACA GGC CGG GAG GCA GCC GGG GAC CTG TTG CGG GCC 6547 

Val Pro Val Gly Thr Gly Arg Glu Ala Ala Gly Asp Leu Leu Arg Ala 
2055 2060 2065 

GCC GGT GAC GGT GCC GCC CGC AGC ACC CGA GTT CTG TGG CTC GCC CCG 6595 
30 Ala Gly Asp Gly Ala Ala Arg Ser Thr Arg Val Leu Trp Leu Ala Pro 

2070 2075 2080 

GCC GAG CCG GAC GCG GCC GAC GCC GTC GCC CTC GTC CAG GCG CTG GGC 6643 
Ala Glu Pro Asp Ala Ala Asp Ala Val Ala Leu Val Gin Ala Leu Gly 
2085 .2090 2095 

35 

GAG GCG GTA CCC GAA GCC CCG CTC TGG ATC ACC ACC CGT GAG GCG GCG 6691 
Glu Ala Val Pro Glu Ala Pro Leu Trp He Thr Thr Arg Glu Ala Ala 
2100 2105 2110 

40 GCC GTG CGG CCG GAC GAG ACC CCT TCC GTC GGG GGC GCT CAG CTG TGG 6739 

Ala Val Arg Pro Asp Glu Thr Pro Ser Val Gly Gly Ala Gin Leu Trp 
2115 2120 2125 2130 
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GGA CTC GGA CAG GTC GCC GCG CTC GAA CTG GGG CGG CGC TGG GGC GGC 6787 
Gly Leu Gly Gin Val Ala Ala Leu Glu Leu Gly Arg Arg Trp Gly Gly 
2135 2140 2145 

TTG GCG GAC CTG CCC GGG AGT GCG TCG CCC GCG GTG CTC CGT ACG TTC 6835 
Leu Ala Asp Leu Pro Gly Ser Ala Ser Pro Ala Val Leu Arg Thr Phe 
2150 2155 2160 

GTC GGG GCG CTG CTC GCC GGG GGA GAG AAC CAG TTC CCG GTA CGG CCC 6883 
Val Gly Ala Leu Leu Ala Gly Gly Glu Asn Gin Phe Ala Val Arg Pro 
2165 2170 2175 

TCC GGC GTC CAT GTC CGC CGT GTG GTT CCC GCG CCC GTC CCC GTC CCG 6931 
Ser Gly Val His Val Arg Arg Val Val Pro Ala Pro Val Pro Val Pro 
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2180 2185 2190 

GCC TCC GCT CGC ACC GTC ACC ACG GCC CCC GCC ACC GCC GTC GGC GAG 6979 
Ala Ser Ala Arg Thr Val Thr Thr Ala Pro Ala Thr Ala Val Gly Glu 
2195 2200 2205 2210 

GAC GCA CGG AAC GAC ACC TCG GAC GTG GTC GTG CCG GAC GAC CGG TGG 7027 
Asp Ala Arg Asn Asp Thr Ser Asp Val Val Val Pro Asp Asp Arg Trp 
2215 2220 2225 

TCC TCC GGC ACC GTA CTG ATC ACC GGG GGC ACC GGT GCC CTG GGT GCG 7075 
Ser Ser Gly Thr Val Leu He Thr Gly Gly Thr Gly Ala Leu Gly Ala 
2230 2235 2240 

CAG GTC GCC CGC AGG CTC GCC CGG TCG GGC GCC GCG CGT CTG CTC CTG 7123 
Gin Val Ala Arg Arg Leu Ala Arg Ser Gly Ala Ala Arg Leu Leu Leu 
2245 2250 2255 

GTG GGC CGG CGC GGC GCG GCC GGC CCC GGA GTG GGC GAA CTC GTC GAG 7171 
Val Gly Arg Arg Gly Ala Ala Gly Pro Gly Val Gly Glu Leu Val Glu 
2260 2265 2270 

GAG CTG ACG GCG CTC GGT TCC GAA GTG GCC GTC GAG GCC TGC GAC GTC 7219 
Glu Leu Thr Ala Leu Gly Ser Glu Val Ala Val Glu Ala Cys Asp Val 
2275 2280 2285 "* 2290 

25 GCC GAC CGG GAC GCA CTG GCC GCG CTC CTC GCG GGC CTC CCC GAG GAG 7267 

Ala Asp Arg Asp Ala Leu Ala Ala Leu Leu Ala Gly Leu Pro Glu Glu 
2295 2300 ~ 2305 

CGG CCC CTC GTC GCC GTA CTG CAC GCG GCA GGT GTG CTC GAC GAC GGT 7315 
30 Pro Leu Val Ala Val Leu His Ala Al a Gly Val Leu Asp Asp Gly 

2310 2315 2320 

GTG CTC GAC TCG CTC ACC TCC GAC CGG GTG GAC GCC GTA CTG CGG GAC 7363 
Val Leu Asp Ser Leu Thr Ser Asp Arg Val Asp Ala Val Leu Arg Asp 
2325 2330 2335 



10 



is 



20 



35 



40 



45 



SO 



55 



AAG GTC ACC GCC GCC CGT CAC CTG GAC GAG CTG ACC GCG GAC CTT CCG 7411 
Lys Val Thr Ala Ala Arg His Leu Asp Glu Leu Thr Ala Asp Leu Pro 
2340 2345 2350 

CTC GAC GCC TTC GTG CTC TTC TCC TCC ATC GTC GGC GTG TGG GGC AAC 7459 
Leu Asp Ala Phe Val Leu Phe Ser Ser He Val Gly Val Trp Gly Asn 
2355 2360 2365 * 2370 

GGA GGG CAG GCC GTC TAC GCG GCC GCC AAC GCC GCG CTC GAC GCC CTG 7507 
Gly Gly Gin Ala Val Tyr Ala Ala Ala Asn Ala Ala Leu Asp Ala Leu 
2375 2380 2385 

GCG CAG CGG CGC CGG GCC AGG GGA GCC CGT GCC GCC TCG ATC GCC TGG 7555 
Ala Gin Arg Arg Arg Ala Arg Gly Ala Arg Ala Ala Ser He Ala Trp 
2390 2395 2400 

GGG CCG TGG GCC GGT GCC GGA ATG GCC TCC GGA ACG GCG GCG AAG TCC 7603 
Gly Pro Trp Ala Gly Ala Gly Met Ala Ser Gly Thr Ala Ala Lys Ser 
2405 2410 2415 

TTC GAA CGG GAC GGC GTC ACG GCC CTG GAC CCC GAG CGC GCG CTC GAC 7651 
Phe Glu Arg Asp Gly Val Thr Ala Leu Asp Pro Glu Arg Ala Leu Asp 
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2660 2665 2670 

CGT GGC TGG GAC GTC GAG GGG CTG TAC GAC CCG GAG CCG GGG GTG CCG 8419 
Arg Gly Trp Asp Val Glu Gly Leu Tyr Asp Pro Glu Pro Gly Val Pro 
2675 2680 2685 2690 

GGC AAG AGC TAT GTA CGG GAA GGC GGG TTC CTG CGT TCC GCG GCC GAG 8467 
Gly Lys Ser Tyr Val Arg Glu Gly Gly Phe Leu Arg Ser Ala Ala Glu 
2695 2700 2705 

I^TgaJ: GCG GAG TTC TTC GGG ATA TCG CCG CGC GAG GCC ACG GCC ATG 8515 
Phe Asp Ala Glu Phe Phe Gly He Ser Pro Arg Glu Ala Thr Ala Met 
2710 2715 2720 

GAC CCG CAG CAG CGG TTC CTG CTG GAG ACG TCG TGG GAG GCG CTG GAG 8563 
'5 Asp Pro Gin Gin Arg Leu Leu Leu Glu Thr Ser Trp Glu Ala Leu Glu 

2725 2730 2735 

CGG GCC GGC ATC GTT CCG GAC TCG CTG CGC GGC ACC CGG ACC GGT GTC 8611 
Arg Ala Gly He Val Pro Asp Ser Leu Arg Gly Thr Arg Thr Gly Val 
20 2740 2745 2750 

TTC AGC GGC ATC TCC CAG CAG GAC TAC GCG ACC CAG CTG GGG GAC GCC 8659 
Phe Ser Gly He Ser Gin Gin Asp Tyr Ala Thr Gin Leu Gly Asp Ala 
2755 2760 2765 2770 

25 GCC GAC ACC TAC GGC GGG CAT GTG CTC ACG GGG ACC CTC GGC AGT GTG 8707 

Ala Asp Thr Tyr Gly Gly His Val Leu Thr Gly Thr Leu Gly Ser Val 
2775 2780 2785 

ATC TCC GGT CGG GTT GCC TAT GCG TTG GGG TTG GAG GGG CCG GCG CTG 8755 
3Q lie Ser Gly Arg Val Ala Tyr Ala Leu Gly Leu Glu Gly Pro Ala Leu 

2790 2795 2800 

ACG GTG GAC ACG GCG TGT TCG TCG TCG TTG GTG GCG TTG CAT CTG GCG 8803 
Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Leu Ala 
2805 2810 2815 

GTG CAG TCG TTG CGG CGG GGT GAG TGT GAT CTG GCG TTG GCC GGT GGG 8851 
Val Gin Ser Leu Arg Arg Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly 
2820 2825 2830 



35 



40 GTG ACG GTG ATG GCG ACG CCG ACG GTG TTC GTG GAG TTC TCG CGG CAG 

Val Thr Val Met Ala Thr Pro Thr Val Phe Val Glu Phe Ser Arg Gin 
2835 2840 2845 2850 
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CGG GGG CTG GCG GCG GAC GGG CGG TGC AAG GCG TTC GCG GAG GGT GCG 8947 
Arg Gly Leu Ala Ala Asp Gly Arg Cys Lys Ala Phe Ala Glu Gly Ala 
2855 2860 2865 

GAC GGG ACG GCG TGG GCG GAG GGT GTG GGT GTG CTG CTG GTG GAG CGG 8995 
Asp Gly Thr Ala Trp Ala Glu Gly Val Gly Val Leu Leu Val Glu Arg 
2870 2875 2880 

CTT TCC GAC GCG CGC CGC AAC GGT CAT CCG GTG CTC GCG GTG GTG CGG 9043 
Leu Ser Asp Ala Arg Arg Asn Gly His Arg Val Leu Ala Val Val Arg 
2885 2890 2895 

GGC AGT GCG GTC AAT CAG GAC GGT GCG AGC AAT GGG CTG ACG GCG CCG 9091 
Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro 
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2900 2905 2910 

AGT GGT CCG GCG CAG CAG CGG GTG ATC CGT GAG GCG CTG GOT GAT GCG 
Ser Gly Pro Ala Gin Gin Arg Val lie Arg Glu Ala Leu Ala Asp Ala 
2915 2920 2925 2930 

GGG CTG GTG CCC GCC GAC GTG GAT GTG GTG GAG GCG CAC GGT ACG GGG 
Gly Leu Val Pro Ala Asp Val Asp Val Val Glu Ala His Gly Thr Gly 
2935 2940 2945 

ACG GCG CTG GGT GAT CCG ATC GAG GCG GGT GCG CTG CTG GCC ACG TAC 
Thr Ala Leu Gly Asp Pro lie Glu Ala Gly Ala Leu Leu Ala Thr Tyr 
2950 2955 2960 

GGG CGG GAG CGG GTC GGC GAT CCG TTG TGG CTC GGG TCG TTG AAG TCG 
Gly Arg Glu Arg Val Gly Asp Pro Leu Trp Leu Gly Ser Leu Lys Ser 
2965 2970 2975 

AAC ATC GGG CAT GCG CAG GCG GCT GCG GGT GTG GGT GGT GTG ATC AAG 
Asn He Gly His Ala Gin Ala Ala Ala Gly Val Gly Gly Val He Lys 
2980 2985 2990 

GTG GTG CAG GGG ATG CGG CAT GGG TCG TTG CCG CGG ACG CTG CAT GTG 
Val Val Gin Gly Met Arg His Gly Ser Leu Pro Arg Thr Leu His Val 
2995 3000 3005 3010 

GAT GCG CCG TCG TCG AAG GTG GAG TGG GCT TCG GGT GCG GTG GAG CTC 
Asp Ala Pro Ser Ser Lys Val Glu Trp Ala Ser Gly Ala Val Glu Leu 
3015 3020 3025 

CTG ACC GAG ACC CGG TCG TGG CCG CGG CGG GTG GAG CGG GTC CGG CGG 
Leu Thr Glu Thr Arg Ser Trp Pro Arg Arg Val Glu Arg Val Arg Arg 
3030 3035 3040 

GCC GCG GTG TCG GCG TTC GGG GTG AGC GGG ACC AAC GCC CAT GTG GTC 
Ala Ala Val Ser Ala Phe Gly Val Ser Gly Thr Asn Ala His Val Val 
3045 3050 3055 

CTG GAG GAA GCG CCG GCG GAG GCC GGG AGC GAG CAC GGG GAC GGC CCT 
Leu Glu Glu Ala Pro Ala Glu Ala Gly Ser Glu His Gly Asp Gly Pro 
3060 3065 3070 

GAA CCT GAG CGG CCC GAC GCG GTC ACG GGT CCG TTC TCG TCG GTC CTT 
Glu Pro Glu Arg Pro Asp Ala Val Thr Gly Pro Leu Ser Trp Val Leu 
3075 3080 3085 * 3090 

TCT GCG CGG TCG GAG GGG GCG TTG CGG GCG CAG GCG GTC CGG TTC CGT 
Ser Ala Arg Ser Glu Gly Ala Leu Arg Ala Gin Ala Val Arg Leu Arg 
3095 3100 3105 

GAG TCT GTC GAG CGG GTC GGT GCG GAT CCG CGG GAT GTC GCG GGG TCG 
Glu Cys Val Glu Arg Val Gly Ala Asp Pro Arg Asp Val Ala Gly Ser 
3H0 3115 3120 

TTC GTG GTC TCG CGT GCG TCG TTC GGT GAG CGT GCG GTC GTG GTG GGC 
Leu Val Val Ser Arg Ala Ser Phe Gly Glu Arg Ala Val Val Val Gly 
3125 3130 3135 

CGG GGG CGT GAG GAG TTC CTG GCG GGT CTC GAT GTG GTC GCT GCC GGG 
Arg Gly Arg Glu Glu Leu Leu Ala Gly Leu Asp Val Val Ala Ala Gly 
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3140 3145 3150 

GCT CCT GTG GGT GTG TCT TCG GGG GCC GGT GCT GTG GTG CGG GGG AGT 
Ala Pro Val Gly Val Ser Ser Gly Ala Gly Ala Val Val Arg Gly Ser 
3155 3160 3165 3170 

GCG GTG CGG GGT CGT GGG GTG GGG GTG TTG TTC ACG GGT CAG GGT GCG 
Ala Val Arg Gly Arg Gly Val Gly Val Leu Phe Thr Gly Gin Gly Ala 
3175 3180 3185 

CAG TGG GTT GGT ATG GGG CGT GGG TTG TAT GCG GGG GGT GGG GTG TTT 
Gin Trp Val Gly Met Gly Arg Gly Leu Tyr Ala Gly Gly Gly Val Phe 
3190 3195 3200 

GCG GAG GTG CTG GAT GAG GTG TTG TCG GTG GTG GGG GAG GTG GAT GGT 
Ala Glu Val Leu Asp Glu Val Leu Ser Val Val Gly Glu Val Asp Gly 
3205 3210 3215 

CGG TCG TTG CGG GAT GTG ATG TTC GCG GAT GCT GAC TCG GTT TTG GGT 
Arg Ser Leu Arg Asp Val Met Phe Ala Asp Ala Asp Ser Val Leu Gly 
3220 3225 3230 

GGG TTG TTG GGT CGG ACG GAG TTT GCT CAG CCT GCG TTG TTT GCG TTC 
Gly Leu Leu Gly Arg Thr Glu Phe Ala Gin Pro Ala Leu Phe Ala Leu 
3235 3240 3245 3250 

GAG GTG GCG TTG TTC CGG GCG TTG GAG GCT CGG GGT GTG GAG GTG TCG 
Glu Val Ala Leu Phe Arg Ala Leu Glu Ala Arg Gly Val Glu Val Ser 
3255 3260 3265 

GTG GTG TTG GGT CAT TCG GTG GGG GAG GTG GCT GCT GCG TAT GTG GCG 
Val Val Leu Gly His Ser Val Gly Glu Val Ala Ala Ala Tyr Val Ala 
3270 3275 3280 

GGG GTG TTG TCG TTG GGT GAT GCG GTG CGG TTG GTG GTG GCG CGG GGT 
Gly Val Leu Ser Leu Gly Asp Ala Val Arg Leu Val Val Ala Arg Gly 
3285 3290 3295 

GGG TTG ATG GGT GGG TTG CCG GTG GGT GGG GGG ATG TGG TCG GTG GGG 
Gly Leu Met Gly Gly Leu Pro Val Gly Gly Gly Met Trp Ser Val Glv 
3300 3305 3310 

GCG TCG GAG TCG GTG GTG CGG GGG GTT GTT GAG GGG TTG GGG GAG TGG 
Ala Ser Glu Ser Val Val Arg Gly Val Val Glu Gly Leu Gly Glu Trp 
3315 3320 3325 3330 

GTG TCG GTT GCG GCG GTG AAT GGG CCG CGG TCG GTG GTG TTG TCG GGT 
Val Ser Val Ala Ala Val Asn Gly Pro Arg Ser Val Val Leu Ser Gly 
3335 3340 3345 

GAT GTG GGT GTG CTG GAG TCG GTG GTT GTC ACG CTG ATG GGG GAT GGG 
Asp Val Gly Val Leu Glu Ser Val Val Val Thr Leu Met Gly Asp Glv 
3350 3355 3360 

GTG GAG TGC CGG CGG TTG GAT GTG TCG CAT GGG TTT CAT TCG GTG TTG 
Val Glu Cys Arg Arg Leu Asp Val Ser His Gly Phe His Ser Val Leu 
3365 3370 3375 

ATG GAG CCG GTG TTG GGG GAG TTC CGG GGG GTT GTG GAG TCG TTG GAG 
Met Glu Pro Val Leu Gly Glu Phe Arg Gly Val Val Glu Ser Leu Glu 



192 



EP 0 791 655 A2 



3380 



3385 



3390 



10 



15 



TTC GGT CGG GTG CGG COG GGT GTG GTG GTG GTG TCG GGT GTG TCG GGT 10579 
Phe Gly Arg Val Arg Pro Gly Val Val Val Val Ser Gly Val Ser Gly 
3395 3400 3405 3410 

GGG GTG GTG GGT TCG GGG GAG TTG GGG GAT CCG GGG TAT TCG GTG CGT 10627 
Gly Val Val Gly Ser Gly Glu Leu Gly Asp Pro Gly Tyr Trp Val Arg 
3415 .v- 3420 3425 

CAT GCG CGG GAG GCG GTG CGT TTC GCG GAT GGG GTG GGG GTG GTG CGT 10675 
His Ala Arg Glu Ala Val Arg Phe Ala Asp Gly Val Gly Val Val Arg 
3430 3435 3440 

GGT CTG GGT GTG GGG ACG TTG GTG GAG GTG GGT CCG CAT GGG GTG CTG 10723 
Gly Leu Gly Val Gly Thr Leu Val Glu Val Gly Pro His Gly Val Leu 
3445 3450 3455 
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ACG GGG ATG GCG GGT CAG TGC CTG GAG GCC GGT GAT GAT GTG GTG GTG 10771 
Thr Gly Met Ala Gly Gin Cys Leu Glu Ala Gly Asp Asp Val Val Val 
3460 3465 3470 

GTG CCG GCG ATG CGG CGG GGC CGT CCG GAG CGG GAG GTG TTC GAG GCG 10819 
Val Pro Ala Met Arg Arg Gly Arg Pro Glu Arg Glu Val Phe Glu Ala 
3475 3480 3485 3490 

GCG CTG GCG ACG GTG TTC ACC CGG GAC GCC GGC CTC GAC GCC ACG ACA 10867 
Ala Leu Ala Thr Val Phe Thr Arg Asp Ala Gly Leu Asp Ala Thr Thr 
3495 3500 3505 

CTC CAC ACC GGG AGC ACC GGC CGA CGC ATC GAC CTC CCC ACC TAC CCC 10915 
Leu His Thr Gly Ser Thr Gly Arg Arg He Asp Leu Pro Thr Tyr Pro 
3510 3515 3520 

TTC CAA CAC AAC CGC TAC TGG GCA ACC GGC TCA GTG ACC GGT GCG ACC 10963 
Phe Gin His Asn Arg Tyr Trp Ala Thr Gly Ser Val Thr Gly Ala Thr 
3525 3530 3535 

GGC ACC TCG GCA GCC GCG CGC TTC GGC CTG GAG TGG AAG GAC CAC CCC 11011 
Gly Thr Ser Ala Ala Ala Arg Phe Gly Leu Glu Trp Lys Asp His Pro 
3540 3545 3550 

TTC CTC AGC GGC GCC ACG CCG ATA GCC GGC TCC GGC GCG CTG CTC CTC 11059 
Phe Leu Ser Gly Ala Thr Pro He Ala Gly Ser Gly Ala Leu Leu Leu 
3555 3560 3565 3570 

ACC GGC AGG GTG GGG CTC GCT GCC CAC CCG TGG CTG GCC GAC CAC GCC 11107 
Thr Gly Arg Val Gly Leu Ala Ala His Pro Trp Leu Ala Asp His Ala 
3575 3580 3585 

ATC TCC GGC ACG GTG CTG CTC CCC GGA ACG GCG ATC GCC GAC CTG CTG 11155 
lie Ser Gly Thr Val Leu Leu Pro Gly Thr Ala He Ala Asp Leu Leu 
3590 3595 3600 

CTG CGG GCG GTC GAG GAG GTC GGC GCC GGA GGG GTC GAG GAA CTG ACG 11203 
Leu Arg Ala Val Glu Glu Val Gly Ala Gly Gly Val Glu Glu Leu Thr 
3605 3610 3615 

CTC CAT GAG CCC CTG CTC CTC CCC GAG CGA GGC GGC CTG CAC GTC CAG 11251 
Leu His Glu Pro Leu Leu Leu Pro Glu Arg Gly Gly Leu His Val Gin 
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GTG CTG GTC GAG GCG GCC GAC GAG CAG GGA CGG CGT GCC GTG GCA GTC 
Val Leu Val Glu Ala Ala Asp Glu Gin Gly Arg Arg Ala Val Ala Val 
3635 3640 3645 3650 



11299 



GCC GCA CGC CCG GAG GGC CCT GGG CGG GAC GGT GAG GAA CAG GAG TGG 11347 
Ala Ala Arg Pro Glu Gly Pro Gly Arg Asp Gly Glu Glu Gin Glu Trp 
3655 3660 3665 

ACC CGG CAC GCG GAA GGC GTG CTC ACC TCC ACC GAG ACG GCC GTT CCG 11395 
Thr Arg His Ala Glu Gly Val Leu Thr Ser Thr Glu Thr Ala Val Pro 
3670 3675 3680 



15 



20 



GAC ATG GGC TGG GCC GCC GGG GCC TGG CCG CCG CCC GGT GCC GAG CCG 
Asp Met Gly Trp Ala Ala Gly Ala Trp Pro Pro Pro Gly Ala Glu Pro 
3685 3690 3695 



11443 



ATC GAC GTC GAG GAG CTG TAC GAC GCG TTC GCC GCG GAC GGC TAC GGC 11491 
He Asp Val Glu Glu Leu Tyr Asp Ala Phe Ala Ala Asp Gly Tyr Gly 
3700 3705 3710 

TAC GGC CCG GCC TTC ACC GCA CTG TCC GGC GTG TGG CGT CTC GGC GAC 11539 
Tyr Gly Pro Ala Phe Thr Ala Leu Ser Gly Val Trp Arg Leu Gly Asp 
3715 3720 3725 3730 

25 GAA CTC TTC GCC GAG GTG CGG CGG CCC GCG GGG GGC GCG GGC ACG ACC 11587 

Glu Leu Phe Ala Glu Val Arg Arg Pro Ala Gly Gly Ala Gly Thr Thr 
3735 3740 3745 

GGT GAC GGT TTC GGC GTC CAC CCC GCA CTC TTC GAT GCG GCC CTC CAC 11635 
3Q Gly Asp Gly Phe Gly Val His Pro Ala Leu Phe Asp Ala Ala Leu His 

3750 3755 3760 

CCG TGG CGC GCC GGC GGG CTG CTG CCC GAC ACG GGC GGC ACC ACC TGG 11683 
Pro Trp Arg Ala Gly Gly Leu Leu Pro Asp Thr Gly Gly Thr Thr Trp 
3765 3770 3775 

35 

GCG CCG TTC TCC TGG CAG GGC ATC GCG CTC CAC ACC ACC GGA GCC GAG 11731 
Ala Pro Phe Ser Trp Gin Gly He Ala Leu His Thr Thr Gly Ala Glu 
3780 3785 3790 



40 



45 



SO 



55 



ACG CTC CGC GTC AGA CTG GCC CCT GCG GCC GGC GGC ACC GAG TCG GCC 11779 
Thr Leu Arg Val Arg Leu Ala Pro Ala Ala Gly Gly Thr Glu Ser Ala 
3795 3800 3805 3810 

TTC TCC GTA CAG GCC GCC GAC CCG GCG GGC ACC CCG GTC CTC ACC CTC 11827 
Phe Ser Val Gin Ala Ala Asp Pro Ala Gly Thr Pro Val Leu Thr Leu 
3815 3820 3825 

GAC GCA CTG CTG CTC CGC CCG GTG ACC CTG GGG AGG GCC GAC GCG CCG 11875 
Asp Ala Leu Leu Leu Arg Pro Val Thr Leu Gly Arg Ala Asp Ala Pro 
3830 3835 3840 

CAA CCG CTG TAC CGC GTC GAC TGG CAG CCG GTC GGC CAG GGG ACC GAG 11923 
Gin Pro Leu Tyr Arg Val Asp Trp Gin Pro Val Gly Gin Gly Thr Glu 
3845 3850 3855 

GCC TCC GGC GCC CAG GGC TGG ACG GTG CTC GGG CAG GCC GCG GCC GAG 11971 
Ala Ser Gly Ala Gin Gly Trp Thr Val Leu Gly Gin Ala Ala Ala Glu 
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3860 



3865 



3870 



w 



15 



ACG GTC GCG CAG CCC GCC GCC CAT GCG GAC CTC ACC GCC CTG CGT ACG 
Thr Val Ala Gin Pro Ala Ala His Ala Asp Leu Thr Ala Leu Arg Thr 
3875 3880 3885 3890 



GCT CGG GCC CGT GCG GGT GAC GGC TGG GAC GAC GAT CCC CTA CGT GTC 
Ala Arg Ala Arg Ala Gly Asp Gly Trp Asp Asp Asp Pro Leu Arg Val 
3925 3930 3935 



12019 



GCT GTG GCC GCG GCG GGA ACA CCC GTG CCC CGG CTG GTG GTC GTG TCG 12067 
Ala Val Ala Ala Ala Gly Thr Pro Val Pro Arg Leu Val Val Val Ser 
3895 3900 3905 

CCG GTG GAC ACC CGG CTG GAC GAG GGG CCG GTG CTG GCG GAC GCC GAG 12115 
Pro Val Asp Thr Arg Leu Asp Glu Gly Pro Val Leu Ala Asp Ala Glu 
3910 3915 3920 



12163 



20 



35 



40 



45 



50 



55 



GCC CTC GGG CGC GGC CTG ACC CTG GTC CGG GAG TGG GTC GAG GAC GAA 12211 
Ala Leu Gly Arg Gly Leu Thr Leu Val Arg Glu Trp Val Glu Asp Glu 
3940 3945 3950 

CGG TTG GCG GAC TCC CGG CTC GTC GTC CTC ACC CGT GGC GCG GTG GCG 12259 
Arg Leu Ala Asp Ser Arg Leu Val Val Leu Thr Arg Gly Ala Val Ala 
3955 3960 3965 3970 



25 GCC GGT CCC GGC GAT GTG CCG GAC CTG ACA GGT GCG GCC CTG TGG GGG 

Ala Gly Pro Gly Asp Val Pro Asp Leu Thr Gly Ala Ala Leu Trp Gly 
3975 3980 3985 

CTG CTC CGC TCC GCG CAG TCG GAG TAT CCG GAC CGC TTC ACC CTC ATC 
30 Leu Leu Arg Ser Ala Gin Ser Glu Tyr Pro Asp Arg Phe Thr Leu lie 

3990 3995 4000 



12307 



12355 



GAC GTG GAC GAT TCC CCC GAG TCC CGT GCG GCT CTG CCC CGG GCT CTG 12403 
Asp Val Asp Asp Ser Pro Glu Ser Arg Ala Ala Leu Pro Arg Ala Leu 
4005 4010 4015 

GGA TCG GCC GAG CGA CAA CTC GCC CTG CGG ACG GGC GAC GTG CTG GCG 12451 
Gly Ser Ala Glu Arg Gin Leu Ala Leu Arg Thr Gly Asp Val Leu Ala 
4020 4025 4030 

CCG GCC CTG GTC CCG ATG GCC ACC CGG CCG GCG GAG ACC ACT CCA GCG 12499 
Pro Ala Leu Val Pro Met Ala Thr Arg Pro Ala Glu Thr Thr Pro Ala 
4035 4040 4045 4050 

ACG GCG GTC GCC TCG GCG ACA ACA CAG ACA CAG GTC ACC GCG CCC GCT 12547 
Thr Ala Val Ala Ser Ala Thr Thr Gin Thr Gin Val Thr Ala Pro Ala 
4055 4060 4065 

CCC GAC GAC CCG GCT GCG GAT GCC GTG TTC GAC CCG GCG GGC ACC GTA 12595 
Pro Asp Asp Pro Ala Ala Asp Ala Val Phe Asp Pro Ala Gly Thr Val 
4070 4075 4080 

CTG ATC ACC GGC GGC ACC GGC GCC CTG GGA CGG CGT GTC GCC TCG CAC 12643 
Leu lie Thr Gly Gly Thr Gly Ala Leu Gly Arg Arg Val Ala Ser His 
4085 4090 4095 



CTC GCG CGC CGG TAC GGC GTA CGC CAC ATG CTT CTG GTC AGC AGG CGT 
Leu Ala Arg Arg Tyr Gly Val Arg His Met Leu Leu Val Ser Arg Arg 



12691 
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4100 



4105 



4110 



10 



is 



20 



25 



30 



35 



40 



45 



SO 



SS 



GGA CCG GAC GCC CCC GAG GCC GGT CCC CTG GAA CGG GAA CTC GCC GGT 12739 
Gly Pro Asp Ala Pro Glu Ala Gly Pro Leu Glu Arg Glu Leu Ala Gly 
4115 4120 4125 4130 

CTC GGA GTC ACC GCC ACC TTC CTG GCA TGC GAC CTC ACC GAC ATC GAG 12787 
Leu Gly Val Thr Ala Thr Phe Leu Ala Cys Asp Leu Thr Asp He Glu 
4135 4140 4145 

GCC GTA CGG AAG GCC GTC GCC GCG GTG CCG TCG GAC CAC CCG CTG ACC 12835 
Ala Val Arg Lys Ala Val Ala Ala Val Pro Ser Asp His Pro Leu Thr 
4150 4155 4160 

GGT GTG GTG CAC ACC GCC GGC GTG CTG GAC GAC GGC GCC CTG ACC GGC 12883 
Gly Val Val His Thr Ala Gly Val Leu Asp Asp Gly Ala Leu Thr Gly 
4165 4170 4175 

CTG ACC CGG CAA CGC CTC GAC ACC GTG CTG CGG CCC AAG GCC GAC GCC 12931 
Leu Thr Arg Gin Arg Leu Asp Thr Val Leu Arg Pro Lys Ala Asp Ala 
4180 4185 4190 

GTG CGG AAC CTC CAC GAG GCG ACC CTC GAC CGG CCG CTG CGC GCG TTC 12979 
Val Arg Asn Leu His Glu Ala Thr Leu Asp Arg Pro Leu Arg Ala Phe 
4195 4200 4205 4210 

GTC CTG TTC TCC GCC GCC GCC GGA CTC CTG GGC CGC CCC GGG CAG GCC 13027 
Val Leu Phe Ser Ala Ala Ala Gly Leu Leu Gly Arg Pro Gly Gin Ala 
4215 4220 ' 4225 

TCC TAC GCC GCC GCC AAC GCG GTC CTC GAC GCG CTC GCG GGA GCC CGC 13075 
Ser Tyr Ala Ala Ala Asn Ala Val Leu Asp Ala Leu Ala Gly Ala Arg 
4230 4235 4240 

CGC GCG GCC GGA CTG CCC GCA GTG TCC CTG GCG TGG GGC CTC TCG GAC 13123 
Arg Ala Ala Gly Leu Pro Ala Val Ser Leu Ala Trp Gly Leu Trp Asp 
4245 4250 4255 

GAG CAG ACG GGC ATC GCA GGA GGC CTC GAC GAG ATC GCC CTG CGC GTC 13171 
Glu Gin Thr Gly Met Ala Gly Gly Leu Asp Glu Met Ala Leu Arg Val 
4260 4265 4270 

CTC CGC CGG GAC GGC ATC GCC GCG ATC CCT CCG GAG CAG GGG CTC GAA 13219 
Leu Arg Arg Asp Gly He Ala Ala Met Pro Pro Glu Gin Gly Leu Glu 
4275 4280 4285 4290 

CTC CTC GAC CTC GCC CTC ACC GGA CAC CGG GAC GGA CCC GCC GTC CTC 13267 
Leu Leu Asp Leu Ala Leu Thr Gly His Arg Asp Gly Pro Ala Val Leu 
4295 4300 4305 

GTC CCC CTC CTC CTC GAC GGC GCG GCC CTC CGC CGC ACG GCG AAG GAG 13315 
Val Pro Leu Leu Leu Asp Gly Ala Ala Leu Arg Arg Thr Ala Lys Glu 
4310 4315 4320 

CGC GGC GCG GCC ACG ATC TCC CCC TTC CTG CGC GCC CTG CTC CCC GCC 133 63 

Arg Gly Ala Ala Thr Met Ser Pro Leu Leu Arg Ala Leu Leu Pro Ala 
4325 4330 4335 

GCC CTC CGC CGC AGC GGT GGA GCC GGC GCC CCC GCG GCG GCC GAC CGG 13411 
Ala Leu Arg Arg Ser Gly Gly Ala Gly Ala Pro Ala Ala Ala Asp Arg 
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4340 4345 4350 

CAC GGC AAG GAG GCG GAC CCC GGT GCG GGA CGC CTC GCA GGG ATG GTC 
His Gly Lys Glu Ala Asp Pro Gly Ala Gly Arg Leu Ala Gly Met Val 
4355 4360 4365 4370 

GCA CTC GAA GCG GCG GAG CGT TCC GCG GCC GTC CTT GAG CTC GTC ACC 
Ala Leu Glu Ala Ala Glu Arg Ser Ala Ala Val Leu Glu Leu Val Thr 
4375 4380 4385 

GAA CAG GTC GCC GAG GTC CTC GGC TAC GCG TCG GCC GCG GAG ATC GAG 
Glu Gin Val Ala Glu Val Leu Gly Tyr Ala Ser Ala Ala Glu He Glu 
4390 4395 4400 

CCC GAA CGA CCC TTC CGG GAG ATC GGC GTC GAC TCC CTG GCG GCG GTG 
Pro Glu Arg Pro Phe Arg Glu He Gly Val Asp Ser Leu Ala Ala Val 
4405 4410 4415 

GAG CTG CGC AAC CGG CTC AGC CGT CTG GTC GGC CTG CGG TTC CCG ACC 
Glu Leu Arg Asn Arg Leu Ser Arg Leu Val Gly Leu Arg Leu Pro Thr 
4420 4425 4430 

ACG CTG TCC TTC GAC CAC CCC ACG CCG AAG GAC ATC GCG CAG CAC ATC 
Thr Leu Ser Phe Asp His Pro Thr Pro Lys Asp Met Ala Gin His He 
4435 4440 4445 4450 

GAC GGG CAG CTC CCC CGC CCG GCC GGA GCC TCG CCC GCG GAC GCA GCG 
Asp Gly Gin Leu Pro Arg Pro Ala Gly Ala Ser Pro Ala Asp Ala Ala 
4455 4460 4465 

CTC GAA GGG ATC GGC GAC CTC GCG CGG GCG GTC GCC CTG CTG GGC ACG 
Leu Glu Gly He Gly Asp Leu Ala Arg Ala Val Ala Leu Leu Gly Thr 
4470 4475 4480 

GGC GAC GCC CGC CGG GCC GAG GTA CGA GAG CAG CTC GTC GGA CTG CTC 
Gly Asp Ala Arg Arg Ala Glu Val Arg Glu Gin Leu Val Gly Leu Leu 
4485 4490 4495 

GCC GCG CTC GAC CCA CCT GGG CGG ACG GGC ACC GCC GCA CCC GGC GTC 
Ala Ala Leu Asp Pro Pro Gly Arg Thr Gly Thr Ala Ala Pro Gly Val 
4500 4505 4510 

CCC TCC GGT GCC GAT GGC GCG GAA CCG ACC GTC ACG GAC CGG CTC GAC 
Pro Ser Gly Ala Asp Gly Ala Glu Pro Thr Val Thr Asp Arg Leu Asp 
4515 4520 4525 * ~ 4530 

GAG GCG ACC GAC GAC GAG ATC TTC GCC TTC CTC GAC GAG CAG CTC TGA 
Glu Ala Thr Asp Asp Glu He Phe Ala Phe Leu Asp Glu Gin Leu * 
4535 4540 * 4545 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4546 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Ser Gly Glu Leu Ala He Ser Arg Ser Asp Asp Arg Ser Asp Ala 
15 10 15 

Val Ala Val Val Gly Met Ala Cys Arg Phe Pro Gly Ala Pro Gly He 
20 25 30 

Ala Glu Phe Trp Glu Leu Leu Arg Ser Gly Arg Gly Met Pro Thr Arg 
35 40 45 

Gin Asp Asp Gly Thr Trp Arg Ala Ala Leu Glu Asp His Ala Gly Phe 
50 55 60 

Asp Ala Gly Phe Phe Gly Met Asn Ala Arg Gin Ala Ala Ala Thr Asp 
*5 65 70 75 80 

Pro Gin His Arg Leu Met Leu Glu Leu Gly Trp Glu Ala Leu Glu Asp 
85 90 95 



70 



Ala Gly He Val Pro Gly Asp Leu Thr Gly Thr Asp Thr Gly Val Phe 
100 105 HO 

Ala Gly Val Ala Ser Asp Asp Tyr Ala Val Leu Thr Arg Arg Ser Ala 
115 120 125 

Val Ser Ala Gly Gly Tyr Thr Ala Thr Gly Leu His Arg Ala Leu Ala 
130 135 140 

Ala Asn Arg Leu Ser His Phe Leu Gly Leu Arg Gly Pro Ser Leu Val 
145 150 155 * 160 

Val Asp Ser Ala Gin Ser Ala Ser Leu Val Ala Val Gin Leu Ala Cys 
165 170 175 

Glu Ser Leu Arg Arg Gly Glu Thr Ser Leu Ala Val Ala Gly Gly Val 
180 185 190 

Asn Leu He Leu Thr Glu Glu Ser Thr Thr Val Met Glu Arg Met Gly 
195 200 205 

Ala Leu Ser Pro Asp Gly Arg Cys His Thr Phe Asp Ala Arg Ala Asn 
210 215 220 

Gly Tyr Val Arg Gly Glu Gly Gly Gly Ala Val Val Leu Lys Pro Leu 
225 230 235 " 240 

Asp Ala Ala Leu Ala Asp Gly Asp Arg Val Tyr Cys Val He Lys Gly 
45 245 250 255 

Gly Ala Val Asn Asn Asp Gly Gly Gly Ala Ser Leu Thr Thr Pro Asp 
260 265 270 

so Glu Ala Gin Glu Ala Val Leu Arg Gin Ala Tyr Arg Arg Ala Gly 

275 280 285 

Val Ser Thr Gly Ala Val Arg Tyr Val Glu Leu His Gly Thr Gly Thr 
290 295 300 



20 



25 



30 



35 



40 



55 



Arg Ala Gly Asp Pro Val Glu Ala Ala Ala Leu Gly Ala Val Leu Gly 
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305 



310 



315 



320 



Ala Gly Ala Asp Ser Gly Arg Ser Thr Pro Leu Ala Val Gly Ser Val 
325 330 335 

Lys Thr Asn Val Gly His Leu Glu Gly Ala Ala Gly lie Val Gly Leu 
340 345 350 



10 



He Lys Ala Thr Leu Cys Val Arg Lys Gly Glu Leu Val Pro Ser Leu 
355 360 ~ 365 



Asn Phe Ser Thr Pro Asn Pro Asp He Pro Leu Asp Asp Leu Arg Leu 
370 375 380 



75 



Arg Val Gin Thr Glu Arg Gin Glu Trp Asn Glu Glu Asp Asp Arg Pro 
385 390 395 400 



Arg Val Ala Gly Val Ser Ser Phe Gly Met Gly Gly Thr Asn Val His 
405 410 415 



20 



Leu Val He Ala Glu Ala Pro Ala Ala Ala Gly Ser Ser Gly Ala Gly 
420 425 430 



Gly Ser Gly Ala Gly Ser Gly Ala Gly He Ser Ala Val Ser Gly Val 
435 440 445 



25 



Val Pro Val Val Val Ser Gly Arg Ser Arg Val Val Val Arg Glu Ala 
450 455 460 



30 



Ala Gly Arg Leu Ala Glu Val Val Glu Ala Gly Gly Val Gly Leu Ala 

465 470 475 480 

Asp Val Ala Val Thr Met Ala Asp Arg Ser Arg Phe Gly Tyr Arg Ala 

485 490 " ' 495 



35 



Val Val Leu Ala Arg Gly Glu Ala Glu Leu Ala Gly Arg Leu Arg Ala 
500 505 510 

Leu Ala Gly Gly Asp Pro Asp Ala Gly Val Val Thr Gly Ala Val Leu 
515 520 525 



40 



Asp Gly Gly Val Val Val Gly Ala Ala Pro Gly Gly Ala Gly Ala Ala 
530 535 540 



Gly Gly Ala Gly Ala Ala Gly Gly Ala Gly Gly Gly Gly Val Val Leu 
545 550 555 560 



45 



Val Phe Pro Gly Gin Gly Thr Gin Trp Val Gly Met Gly Ala Gly Leu 
565 570 575 



Leu Gly Ser Ser Glu Val Phe Ala Ala Ser Met Arg Glu Cys Ala Arg 
580 585 590 



50 



Ala Leu Ser Val His Val Gly Trp Asp Leu Leu Glu Val Val Ser Gly 
595 600 605 



Gly Ala Gly Leu Glu Arg Val Asp Val Val Gin Pro Val Thr Trp Ala 
610 615 620 



55 



Val Met Val Ser Leu Ala Arg Tyr Trp Gin Ala Met Gly Val Asp Val 
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625 



630 



635 



640 



Ala Ala Val Val Gly His Ser Gin Gly Glu He Ala Ala Ala Thr Val 
645 650 655 

Ala Gly Ala Leu Ser Leu Glu Asp Ala Ala Ala Val Val Ala Leu Arg 
660 665 670 



10 



Ala Gly Leu He Gly Arg Tyr Leu Ala Gly Arg Gly Ala Met Ala Ala 
675 680 685 



Val Pro Leu Pro Ala Gly Glu Val Glu Ala Gly Leu Ala Lys Trp Pro 
690 695 700 



15 



20 



Gly Val Glu Val Ala Ala Val Asn Gly Pro Ala Ser Thr Val Val Ser 
705 710 715 720 

Gly Asp Arg Arg Ala Val Ala Gly Tyr Val Ala Val Cys Gin Ala Glu 
725 730 735 

Gly Val Gin Ala Arg Leu He Pro Val Asp Tyr Ala Ser His Ser Arg 
740 745 750 



His Val Glu Asp Leu Lys Gly Glu Leu Glu Arg Val Leu Ser Gly He 
755 760 765 



25 



Arg Pro Arg Ser Pro Arg Val Pro Val Cys Ser Thr Val Ala Gly Glu 
770 775 780 



30 



35 



Gin Pro Gly Glu Pro Val Phe Asp Ala Gly Tyr Trp Phe Arg Asn Leu 
785 790 795 800 

Arg Asn Arg Val Glu Phe Ser Ala Val Val Gly Gly Leu Leu Glu Glu 
805 810 815 

Gly His Arg Arg Phe He Glu Val Ser Ala His Pro Val Leu Val His 
820 825 830 

Ala He Glu Gin Thr Ala Glu Ala Ala Asp Arg Ser Val His Ala Thr 
835 840 845 



40 



Gly Thr Leu Arg Arg Gin Asp Asp Ser Pro His Arg Leu Leu Thr Ser 
8 *0 855 860 



45 



50 



55 



Thr Ala Glu Ala Trp Ala His Gly Ala Thr Leu Thr Trp Asp Pro Ala 
865 870 875 880 

Leu Pro Pro Gly His Leu Thr Thr Leu Pro Thr Tyr Pro Phe Asn His 
885 890 895 

His His Tyr Trp Leu Asp Thr lie Asp Gly Gly Gly Gly Asp Asp Ala 
900 905 910 

Thr Gin Glu Lys Glu Ser Gly Pro Leu Thr Arg Glu Leu Arg Gly Leu 
915 920 925 

Pro Ser Ser Gin Lys Gin Leu Gly Phe Leu Leu Asp Leu Val Cys Ara 
930 935 940 

His Thr Ala Val Val Leu Gly Leu Asp Thr Ala Ala Glu Val Asp Pro 
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945 950 955 960 

Asp Leu Ser Phe Lys Lys Gin Gly He Gin Ser Met Thr Gly Val Glu 
965 970 975 

5 

Leu Arg Asn Arg Leu Leu Thr Glu Thr Gly Leu Ala Leu Pro Thr Thr 
980 985 990 

Leu Val Tyr Asp Arg Pro Thr Pro Arg Ala Leu Ala Gin Phe Leu His 
ro 995 1000 1005 

Thr Glu Leu Leu Asp Gly Ser Pro Ser Gly Ser Val Leu Ala Pro Ala 
1010 1015 1020 

Gin Lys Ser Phe Glu Ala Gly Gly Pro Gly Val Leu Ser Ser Ala Ala 
75 1025 1030 1035 1040 

Val Gly Val Ser Asp Ala Arg Gly Gly Ser Arg Asp Asp Asp Asp Pro 
1045 1050 1055 

20 Ile Ala Ile Val Gl Y Val G ly Cys Arg Leu Pro Gly Gly Val Asp Ser 

1060 1065 1070 

Arg Ala Ala Leu Trp Glu Leu Leu Glu Ser Gly Ala Asp Ala Ile Ser 
1075 1080 " 1085 

25 Se r Phe Pro Thr Asp Arg Gly Trp Asp Leu Asp Gly Leu Tyr Asp Pro 

1090 1095 1100 

Glu Pro Gly Thr Pro Gly Lys Thr Tyr Val Arg Glu Gly Gly Phe Leu 
H05 1110 U15 - * H20 

His Ser Ala Ala Glu Phe Asp Ala Glu Phe Phe Gly Ile Ser Pro Arg 
1125 H30 1135 

Glu Ala Thr Ala Met Asp Pro Gin Gin Arg Leu Leu Leu Glu Ala Ser 
1140 1145 H50 

Trp Glu Ala Leu Glu Asp Ala Gly Val Leu Pro Glu Ser Leu Arg Gly 
1155 1160 1165 

Gly Asp Ala Gly Val Phe Val Gly Ala Thr Ala Pro Glu Tyr Gly Pro 
1170 1175 1180 

Arg Leu His Glu Gly Ala Asp Gly Tyr Glu Gly Tyr Leu Leu Thr Gly 
1185 1190 1195 1200 

Thr Thr Ala Ser Val Ala Ser Gly Arg Ile Ala Tyr Thr Leu Gly Thr 
45 1205 1210 ~ 1215 

Gly Gly Pro Ala Leu Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val 
1220 1225 1230 

so Ala Leu His Leu Ala Val Gin Ala Leu Arg Arg Gly Glu Cys Gly Leu 

1235 1240 1245 

Ala Leu Ala Gly Gly Ala Thr Val Met Ser Gly Pro Gly Met Phe Val 
1250 1255 1260 



30 



35 



40 



55 



Glu Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg Cys Met Pro 
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1265 1270 1275 1280 

Phe Ser Ala Asp Ala Asp Gly Thr Ala Trp Ser Glu Gly Val Ala Val 
1285 1290 1295 

Leu Ala Leu Glu Arg Leu Ser Asp Ala Arg Arg Ala Gly His Arg Val 
1300 1305 1310 

Leu Gly Val Val Arg Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn 
1315 1320 1325 

Gly Leu Thr Ala Pro Asn Arg Ser Ala Gin Glu Gly Val lie Arg Ala 
1330 1335 1340 

Ala Leu Ala Asp Ala Gly Leu Ala Pro Gly Asp Val Asp Ala Val Glu 
1345 1350 1355 1360 

Ala His Gly Thr Gly Thr Ala Leu Gly Asp Pro lie Glu Ala Ser Ala 
1365 1370 1375 

Leu Leu Ala Thr Tyr Gly Arg Glu Arg Val Gly Asp Pro Leu Trp Leu 
1380 1385 1390 

Gly Ser Leu Lys Ser Asn Val Gly His Thr Gin Ala Ala Ala Gly Ala 
1395 1400 1405 

Ala Gly Val Val Lys Met Leu Leu Ala Leu Glu His Gly Thr Leu Pro 
1410 1415 1420 

Arg Thr Leu His Ala Asp Arg Pro Ser Thr His Val Asp Trp Ser Ser 
1425 1430 1435 * 1440 

Gly Thr Val Ala Leu Leu Ala Glu Ala Arg Arg Trp Pro Arg Arg Ser 
1445 1450 1455 

Asp Arg Pro Arg Arg Ala Ala Val Ser Ser Phe Gly He Ser Gly Thr 
1460 1465 1470 

Asn Ala His Leu He He Glu Glu Ala Pro Glu Trp Val Glu Asp He 
1475 1480 " 1485 

Asp Gly Val Ala Ala Pro Asp Arg Gly Thr Ala Asp Ala Ala Ala Pro 
1490 1495 1500 

Ser Pro Leu Leu Leu Ser Ala Arg Ser Glu Gly Ala Leu Arg Ala Gin 
1505 1510 1515 1520 

Ala Val Arg Leu Gly Glu Tyr Val Glu Arg Val Gly Ala Asp Pro Arg 
1525 1530 1535 

Asp Val Ala Tyr Ser Leu Ala Ser Thr Arg Thr Leu Phe Glu His Arg 
1540 1545 1550 

Ala Val Val Pro Cys Gly Gly Arg Gly Glu Leu Val Ala Ala Leu Gly 
1555 1560 1565 

Gly Phe Ala Ala Gly Arg Val Ser Gly Gly Val Arg Ser Gly Arg Ala 
1570 1575 1580 

Val Pro Gly Gly Val Gly Val Leu Phe Thr Gly Gin Gly Ala Gin Trp 
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1585 1590 1595 1600 

Val Gly Met Gly Arg Gly Leu Tyr Ala Gly Gly Gly Val Phe Ala Glu 
1605 1610 1615 

Val Leu Asp Glu Val Leu Ser Met Val Gly Glu Val Asp Gly Arg Ser 
1620 1625 1630 

Leu Arg Asp Val Met Phe Gly Asp Val Asp Val Asp Ala Gly Ala Gly 
1635 1640 1645 

Ala Asp Ala Gly Ala Gly Ala Gly Ala Gly Val Gly Ser Gly Ser Gly 
1650 1655 1660 

Ser Val Gly Gly Leu Leu Gly Arg Thr Glu Phe Ala Gin Pro Ala Leu 
1665 1670 1675 1680 

Phe Ala Leu Glu Val Ala Leu Phe Arg Ala Leu Glu Ala Arg Gly Val 
1685 1690 1695 

Glu Val Ser Val Val Leu Gly His Ser Val Gly Glu Val Ala Ala Ala 
1700 1705 1710 

Tyr Val Ala Gly Val Leu Ser Leu Gly Asp Ala Val Arg Leu Val Val 
1715 1720 1725 

Ala Arg Gly Gly Leu Met Gly Gly Leu Pro Val Gly Gly Gly Met Trp 
1730 1735 1740 

Ser Val Gly Ala Ser Glu Ser Val Val Arg Gly Val Val Glu Gly Leu 
I 74 * 1750 1755 1760 

Gly Glu Trp Val Ser Val Ala Ala Val Asn Gly Pro Arg Ser Val Val 
1765 1770 ~ 1775 

Leu Ser Gly Asp Val Gly Val Leu Glu Ser Val Val Ala Ser Leu Met 
1780 1785 1790 

Gly Asp Gly Val Glu Cys Arg Arg Leu Asp Val Ser His Gly Phe His 
1795 1800 1805 

Ser Val Leu Met Glu Pro Val Leu Gly Glu Phe Arg Gly Val Val Glu 
1810 1815 1820 

Ser Leu Glu Phe Gly Arg Val Arg Pro Gly Val Val Val Val Ser Gly 
1825 1830 1835 1840 

Val Ser Gly Gly Val Val Gly Ser Gly Glu Leu Gly Asp Pro Gly Tyr 
1845 1850 1855 

Trp Val Arg His Ala Arg Glu Ala Val Arg Phe Ala Asp Gly Val Gly 
I860 1865 1870 

Val Val Arg Gly Leu Gly Val Gly Thr Leu Val Glu Val Gly Pro His 
1875 1880 1885 

Gly Val Leu Thr Gly Met Ala Gly Glu Cys Leu Gly Ala Gly Asp Asp 
1890 1895 1900 

Val Val Val Val Pro Ala Met Arg Arg Gly Arg Ala Glu Arg Glu Val 
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10 



1905 1910 1915 1920 

Phe Glu Ala Ala Leu Ala Thr Val Phe Thr Arg Asp Ala Gly Leu Asp 
1925 1930 * 1935 

Ala Thr Ala Leu His Thr Gly Ser Thr Gly Arg Arg lie Asp Leu Pro 
1940 1945 1950 

Thr Tyr Pro Phe Gin Arg Asp Arg Tyr Trp Leu Asp Pro Val Arg Thr 
1955 I960 1965 

Ala Val Thr Gly Val Glu Pro Ala Gly Ser Pro Ala Asp Ala Arg Ala 
1970 1975 1980 

Thr Glu Arg Gly Arg Ser Thr Thr Ala Gly lie Arg Tyr Arg Val Ala 
75 1985 1990 1995 2000 

Trp Gin Pro Ala Val Val Asp Arg Gly Asn Pro Gly Pro Ala Gly His 
2005 2010 * 2015 

20 VaX Leu Leu Leu Ala p r° Asp Glu Asp Thr Ala Asp Ser Gly Leu Ala 

2020 2025 2030 

Pro Ala He Ala Arg Glu Leu Ala Val Arg Gly Ala Glu Val His Thr 
2035 2040 2045 

25 Val Ala Val Pro Val Gly Thr Gly Arg Glu Ala Ala Gly Asp Leu Leu 

2050 2055 2060 

Arg Ala Ala Gly Asp Gly Ala Ala Arg Ser Thr Arg Val Leu Trp Leu 
2065 2070 2075 2080 

30 

Ala Pro Ala Glu Pro Asp Ala Ala Asp Ala Val Ala Leu Val Gin Ala 
2085 2090 2095 

Leu Gly Glu Ala Val Pro Glu Ala Pro Leu Trp He Thr Thr Arg Glu 
2100 2105 2110 

Ala Ala Ala Val Arg Pro Asp Glu Thr Pro Ser Val Gly Gly Ala Gin 
2115 2120 2125 

Leu Trp Gly Leu Gly Gin Val Ala Ala Leu Glu Leu Gly Arg Arg Trp 
2130 2135 2140 

Gly Gly Leu Ala Asp Leu Pro Gly Ser Ala Ser Pro Ala Val Leu Arg 
21 45 2150 2155 2160 

Thr Phe Val Gly Ala Leu Leu Ala Gly Gly Glu Asn Gin Phe Ala Val 
2165 2170 2175 

Arg Pro Ser Gly Val His Val Arg Arg Val Val Pro Ala Pro Val Pro 
2180 2185 2190 

so Val Pro Ala Ser Ala Arg Thr Val Thr Thr Ala Pro Ala Thr Ala Val 

2195 2200 2205 

Gly Glu Asp Ala Arg Asn Asp Thr Ser Asp Val Val Val Pro Asp Asp 
2210 2215 2220 



35 



40 



45 



55 



Arg Trp Ser Ser Gly Thr Val Leu He Thr Gly Gly Thr Gly Ala Leu 
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2225 2230 2235 2240 

Gly Ala Gin Val Ala Arg Arg Leu Ala Arg Ser Gly Ala Ala Arg Leu 
2245 2250 2255 

Leu Leu Val Gly Arg Arg Gly Ala Ala Gly Pro Gly Val Gly Glu Leu 
2260 2265 2270 

Val Glu Glu Leu Thr Ala Leu Gly Ser Glu Val Ala Val Glu Ala Cys 
2275 2280 2285 

Asp Val Ala Asp Arg Asp Ala Leu Ala Ala Leu Leu Ala Gly Leu Pro 
2290 2295 2300 

Glu Glu Arg Pro Leu Val Ala Val Leu His Ala Ala Gly Val Leu Asp 
2305 2310 2315 2320 

Asp Gly Val Leu Asp Ser Leu Thr Ser Asp Arg Val Asp Ala Val Leu 
2325 2330 2335 

Arg Asp Lys Val Thr Ala Ala Arg His Leu Asp Glu Leu Thr Ala Asp 
2340 2345 2350 

Leu Pro Leu Asp Ala Phe Val Leu Phe Ser Ser lie Val Gly Val Trp 
2355 2360 2365 

Gly Asn Gly Gly Gin Ala Val Tyr Ala Ala Ala Asn Ala Ala Leu Asp 
2370 2375 2380 

Ala Leu Ala Gin Arg Arg Arg Ala Arg Gly Ala Arg Ala Ala Ser lie 
2385 2390 2395 2400 

Ala Trp Gly Pro Trp Ala Gly Ala Gly Met Ala Ser Gly Thr Ala Ala 
2405 2410 2415 

Lys Ser Phe Glu Arg Asp Gly Val Thr Ala Leu Asp Pro Glu Arg Ala 
2420 2425 2430 

Leu Asp Val Leu Asp Asp Val Val Gly Ala Gly Gly Thr Ser Ala Ala 
2435 2440 2445 

Gly Thr His Ala Ala Gly Glu Ser Ser Leu Leu Val Ala Asp Val Asp 
2450 2455 2460 

Trp Glu Thr Phe Val Gly Arg Ser Val Thr Arg Arg Thr Trp Ser Leu 
2465 2470 2475 2480 

Phe Asp Gly Val Ser Ala Ala Arg Ser Ala Arg Ala Gly His Ala Ala 
2485 2490 2495 

Asp Asp Arg Ala Ala Leu Thr Pro Gly Thr Arg Pro Gly Asp Gly Ala 
2500 2505 2510 

Pro Gly Gly Ser Gly Gin Asp Gly Gly Glu Gly Arg Pro Trp Leu Ser 
2515 2520 2525 

Val Gly Pro Ser Pro Ala Glu Arg Arg Arg Ala Leu Leu Thr Leu Val 
2530 2535 2540 

Arg Ser Glu Ala Ala Gly He Leu Arg His Ala Ser Ala Asp Ala Val 
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2545 2550 2555 2560 

Asp Pro Glu Leu Ala Phe Arg Ser Ala Gly Phe Asp Ser Leu Thr Val 
2565 2570 2575 

Leu Glu Leu Arg Asn Arg Leu Thr Ala Ala Thr Gly Leu Asn Leu Pro 
2580 2585 2590 

Asn Thr Leu Leu Phe Asp His Pro Thr Pro Leu Ser Leu Ala Ser His 
2595 2600 2605 

Leu His Asp Glu Leu Phe Gly Pro Asp Ser Glu Ala Glu Pro Ala Ala 
2610 2615 2620 

Ala Ala Pro Thr Pro Val Met Ala Asp Glu Arg Glu Pro He Ala He 
2625 2630 2635 2640 

Val Gly Met Ala Cys Arg Tyr Pro Gly Gly Val Ala Ser Pro Asp Asp 
2645 2650 2655 

Leu Trp Asp Leu Val Ala Gly Asp Gly His Thr Leu Ser Pro Phe Pro 
2660 2665 2670 

Ala Asp Arg Gly Trp Asp Val Glu Gly Leu Tyr Asp Pro Glu Pro Gly 
2675 2680 ' 2685 

Val Pro Gly Lys Ser Tyr Val Arg Glu Gly Gly Phe Leu Arg Ser Ala 
2690 2695 2700 

Ala Glu Phe Asp Ala Glu Phe Phe Gly He Ser Pro Arg Glu Ala Thr 
2705 2710 2715 2720 

Ala Met Asp Pro Gin Gin Arg Leu Leu Leu Glu Thr Ser Trp Glu Ala 
2725 2730 2735 

Leu Glu Arg Ala Gly He Val Pro Asp Ser Leu Arg Gly Thr Arg Thr 
2740 2745 2750 

Gly Val Phe Ser Gly He Ser Gin Gin Asp Tyr Ala Thr Gin Leu Gly 
2755 2760 2765 

Asp Ala Ala Asp Thr Tyr Gly Gly His Val Leu Thr Gly Thr Leu Gly 
2770 2775 2780 

Ser Val He Ser Gly Arg Val Ala Tyr Ala Leu Gly Leu Glu Gly Pro 
2785 2790 2795 2800 

Ala Leu Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His 
2805 2810 2815 

Leu Ala Val Gin Ser Leu Arg Arg Gly Glu Cys Asp Leu Ala Leu Ala 
2820 2825 * 2830 

Gly Gly Val Thr Val Met Ala Thr Pro Thr Val Phe Val Glu Phe Ser 
2835 2840 2845 

Arg Gin Arg Gly Leu Ala Ala Asp Gly Arg Cys Lys Ala Phe Ala Glu 
2850 2855 2860 

Gly Ala Asp Gly Thr Ala Trp Ala Glu Gly Val Gly Val Leu Leu Val 
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2865 2870 2875 2880 

Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly His Arg Val Leu Ala Val 
2885 2890 2895 

Val Arg Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr 
2900 2905 2910 

Ala Pro Ser Gly Pro Ala Gin Gin Arg Val He Arg Glu Ala Leu Ala 
2915 2920 2925 

Asp Ala Gly Leu Val Pro Ala Asp Val Asp Val Val Glu Ala His Gly 
2930 2935 2940 

Thr Gly Thr Ala Leu Gly Asp Pro He Glu Ala Gly Ala Leu Leu Ala 
2945 2950 2955 2960 

Thr Tyr Gly Arg Glu Arg Val Gly Asp Pro Leu Trp Leu Gly Ser Leu 
2965 2970 2975 

Lys Ser Asn He Gly His Ala Gin Ala Ala Ala Gly Val Gly Gly Val 
2980 2985 2990 

He Lys Val Val Gin Gly Met Arg His Gly Ser Leu Pro Arg Thr Leu 
2995 3000 3005 

His Val Asp Ala Pro Ser Ser Lys Val Glu Trp Ala Ser Gly Ala Val 
3010 3015 3020 

Glu Leu Leu Thr Glu Thr Arg Ser Trp Pro Arg Arg Val Glu Arg Val 
3025 3030 3035 3040 

Arg Arg Ala Ala Val Ser Ala Phe Gly Val Ser Gly Thr Asn Ala His 
3045 3050 3055 

Val Val Leu Glu Glu Ala Pro Ala Glu Ala Gly Ser Glu His Gly Asp 
3060 3065 3070 

Gly Pro Glu Pro Glu Arg Pro Asp Ala Val Thr Gly Pro Leu Ser Trp 
3075 3080 3085 

Val Leu Ser Ala Arg Ser Glu Gly Ala Leu Arg Ala Gin Ala Val Arg 
3090 3095 3100 

Leu Arg Glu Cys Val Glu Arg Val Gly Ala Asp Pro Arg Asp Val Ala 
3105 3110 3115 3120 

Gly Ser Leu Val Val Ser Arg Ala Ser Phe Gly Glu Arg Ala Val Val 
3125 3130 3135 

Val Gly Arg Gly Arg Glu Glu Leu Leu Ala Gly Leu Asp Val Val Ala 
3140 3145 3150 

Ala Gly Ala Pro Val Gly Val Ser Ser Gly Ala Gly Ala Val Val Arg 
3155 3160 3165 

Gly Ser Ala Val Arg Gly Arg Gly Val Gly Val Leu Phe Thr Gly Gin 
3170 3175 3180 

Gly Ala Gin Trp Val Gly Met Gly Arg Gly Leu Tyr Ala Gly Gly Gly 
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3185 3190 3195 3200 

Val Phe Ala Glu Val Leu Asp Glu Val Leu Ser Val Val Gly Glu Val 
3205 3210 3215 

5 

Asp Gly Arg Ser Leu Arg Asp Val Met Phe Ala Asp Ala Asp Ser Val 
3220 3225 3230 

Leu Gly Gly Leu Leu Gly Arg Thr Glu Phe Ala Gin Pro Ala Leu Phe 
10 3235 3240 3245 

Ala Leu Glu Val Ala Leu Phe Arg Ala Leu Glu Ala Arg Gly Val Glu 
3250 3255 3260 

Val Ser Val Val Leu Gly His Ser Val Gly Glu Val Ala Ala Ala Tyr 
is 3265 3270 3275 3280 

Val Ala Gly Val Leu Ser Leu Gly Asp Ala Val Arg Leu Val Val Ala 
3285 3290 " 3295 

20 Arg Gly Gly Leu Met Gly Gly Leu Pro Val Gly Gly Gly Met Trp Ser 

3300 3305 3310 

Val Gly Ala Ser Glu Ser Val Val Arg Gly Val Val Glu Gly Leu Gly 
3315 3320 3325 

25 Glu Trp Val Ser Val Ala Ala Val Asn Gly Pro Arg Ser Val Val Leu 

3330 3335 3340 

Ser Gly Asp Val Gly Val Leu Glu Ser Val Val Val Thr Leu Met Gly 
3345 3350 3355 3360 

30 

Asp Gly Val Glu Cys Arg Arg Leu Asp Val Ser His Gly Phe His Ser 
3365 3370 3375 

Val Leu Met Glu Pro Val Leu Gly Glu Phe Arg Gly Val Val Glu Ser 
3380 3385 3390 

35 

Leu Glu Phe Gly Arg Val Arg Pro Gly Val Val Val Val Ser Gly Val 
3395 3400 3405 

Ser Gly Gly Val Val Gly Ser Gly Glu Leu Gly Asp Pro Gly Tyr Trp 
40 3410 3415 3420 

Val Arg His Ala Arg Glu Ala Val Arg Phe Ala Asp Gly Val Gly Val 
3425 3430 3435 3440 

Val Arg Gly Leu Gly Val Gly Thr Leu Val Glu Val Gly Pro His Gly 
45 3445 3450 * 3455 

Val Leu Thr Gly Met Ala Gly Gin Cys Leu Glu Ala Gly Asp Asp Val 
3460 3465 3470 

so Val Val Va * P*o Ala Met Arg Arg Gly Arg Pro Glu Arg Glu Val Phe 

3475 3480 3485 

Glu Ala Ala Leu Ala Thr Val Phe Thr Arg Asp Ala Gly Leu Asp Ala 
3490 3495 3500 

55 Thr Thr Leu His Thr Gly Ser Thr Gly Arg Arg lie Asp Leu Pro Thr 
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3505 3510 3515 3520 

Tyr Pro Phe Gin His Asn Arg Tyr Trp Ala Thr Gly Ser Val Thr Gly 
3525 3530 3535 

Ala Thr Gly Thr Ser Ala Ala Ala Arg Phe Gly Leu Glu Trp Lys Asp 
3540 3545 3550 

His Pro Phe Leu Ser Gly Ala Thr Pro He Ala Gly Ser Gly Ala Leu 
3555 3560 3565 

Leu Leu Thr Gly Arg Val Gly Leu Ala Ala His Pro Trp Leu Ala Asp 
3570 3575 3580 

His Ala He Ser Gly Thr Val Leu Leu Pro Gly Thr Ala He Ala Asp 
3585 3590 3595 3600 

Leu Leu Leu Arg Ala Val Glu Glu Val Gly Ala Gly Gly Val Glu Glu 
3605 3610 * 3615 

Leu Thr Leu His Glu Pro Leu Leu Leu Pro Glu Arg Gly Gly Leu His 
3620 3625 3630 

Val Gin Val Leu Val Glu Ala Ala Asp Glu Gin Gly Arg Arg Ala Val 
3635 3640 3645 

Ala Val Ala Ala Arg Pro Glu Gly Pro Gly Arg Asp Gly Glu Glu Gin 
3650 3655 3660 

Glu Trp Thr Arg His Ala Glu Gly Val Leu Thr Ser Thr Glu Thr Ala 
3665 3670 3675 3680 

Val Pro Asp Met Gly Trp Ala Ala Gly Ala Trp Pro Pro Pro Gly Ala 
3685 3690 3695 

Glu Pro He Asp Val Glu Glu Leu Tyr Asp Ala Phe Ala Ala Asp Gly 
3700 3705 3710 

Tyr Gly Tyr Gly Pro Ala Phe Thr Ala Leu Ser Gly Val Trp Arg Leu 
3715 3720 3725 

Gly Asp Glu Leu Phe Ala Glu Val Arg Arg Pro Ala Gly Gly Ala Gly 
3730 3735 3740 

Thr Thr Gly Asp Gly Phe Gly Val His Pro Ala Leu Phe Asp Ala Ala 
3745 3750 3755 3760 

Leu His Pro Trp Arg Ala Gly Gly Leu Leu Pro Asp Thr Gly Gly Thr 
3765 3770 3775 

Thr Trp Ala Pro Phe Ser Trp Gin Gly He Ala Leu His Thr Thr Gly 
3780 3785 3790 

Ala Glu Thr Leu Arg Val Arg Leu Ala Pro Ala Ala Gly Gly Thr Glu 
3795 3800 3805 

Ser Ala Phe Ser Val Gin Ala Ala Asp Pro Ala Gly Thr Pro Val Leu 
3810 3815 3820 

Thr Leu Asp Ala Leu Leu Leu Arg Pro Val Thr Leu Gly Arg Ala Asp 
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3825 3830 3835 3840 

Ala Pro Gin Pro Leu Tyr Arg Val Asp Trp Gin Pro Val Gly Gin Gly 
3845 3850 3855 

5 

Thr Glu Ala Ser Gly Ala Gin Gly Trp Thr Val Leu Gly Gin Ala Ala 
3860 3865 3870 

Ala Glu Thr Val Ala Gin Pro Ala Ala His Ala Asp Leu Thr Ala Leu 
10 3875 3880 3885 

Arg Thr Ala Val Ala Ala Ala Gly Thr Pro Val Pro Arg Leu Val Val 
3890 3895 3900 

Val Ser Pro Val Asp Thr Arg Leu Asp Glu Gly Pro Val Leu Ala Asp 
15 3905 3910 3915 3920 

Ala Glu Ala Arg Ala Arg Ala Gly Asp Gly Trp Asp Asp Asp Pro Leu 
3925 3930 * 3935 

20 Ar 9 Va1 Ala Leu G1 Y Arg Gly Leu Thr Leu Val Arg Glu Trp Val Glu 

3940 3945 3950 

Asp Glu Arg Leu Ala Asp Ser Arg Leu Val Val Leu Thr Arg Gly Ala 
3955 3960 3965 

25 Val Ala Ala Gly Pro Gly Asp Val Pro Asp Leu Thr Gly Ala Ala Leu 

3970 3975 3980 

Trp Gly Leu Leu Arg Ser Ala Gin Ser Glu Tyr Pro Asp Arg Phe Thr 
3985 3990 3995 4000 

Leu He Asp Val Asp Asp Ser Pro Glu Ser Arg Ala Ala Leu Pro Arg 
4005 4010 4015 

Ala Leu Gly Ser Ala Glu Arg Gin Leu Ala Leu Arg Thr Gly Asp Val 
4020 4025 4030 

Leu Ala Pro Ala Leu Val Pro Met Ala Thr Arg Pro Ala Glu Thr Thr 
4035 4040 4045 

Pro Ala Thr Ala Val Ala Ser Ala Thr Thr Gin Thr Gin Val Thr Ala 
40 4050 4055 4060 

Pro Ala Pro Asp Asp Pro Ala Ala Asp Ala Val Phe Asp Pro Ala Gly 
4065 4070 4075 4080 



30 
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45 



SO 



55 



Thr Val Leu He Thr Gly Gly Thr Gly Ala Leu Gly Arg Arg Val Ala 
4085 4090 4095 

Ser His Leu Ala Arg Arg Tyr Gly Val Arg His Met Leu Leu Val Ser 
4100 4105 4110 

Arg Arg Gly Pro Asp Ala Pro Glu Ala Gly Pro Leu Glu Arg Glu Leu 

4120 4125 

Ala Gly Leu Gly Val Thr Ala Thr Phe Leu Ala Cys Asp Leu Thr Asp 
4130 4135 4140 

He Glu Ala Val Arg Lys Ala Val Ala Ala Val Pro Ser Asp His Pro 
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4145 4150 4155 4160 

Leu Thr Gly Val Val His Thr Ala Gly Val Leu Asp Asp Gly Ala Leu 
5 4165 4170 4175 

Thr Gly Leu Thr Arg Gin Arg Leu Asp Thr Val Leu Arg Pro Lys Ala 
4180 4185 4190 

Asp Ala Val Arg Asn Leu His Glu Ala Thr Leu Asp Arg Pro Leu Arg 
10 4195 4200 4205 

Ala Phe Val Leu Phe Ser Ala Ala Ala Gly Leu Leu Gly Arg Pro Gly 
4210 4215 4220 

is Gin Ala Ser Tyr Ala Ala Ala Asn Ala Val Leu Asp Ala Leu Ala Gly 

4225 4230 4235 4240 

Ala Arg Arg Ala Ala Gly Leu Pro Ala Val Ser Leu Ala Trp Gly Leu 
4245 4250 4255 

20 

Trp Asp Glu Gin Thr Gly Met Ala Gly Gly Leu Asp Glu Met Ala Leu 
4260 4265 4270 

Arg Val Leu Arg Arg Asp Gly He Ala Ala Met Pro Pro Glu Gin Gly 
2s 4275 4280 4285 

Leu Glu Leu Leu Asp Leu Ala Leu Thr Gly His Arg Asp Gly Pro Ala 
4290 4295 4300 
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Val Leu Val Pro Leu Leu Leu Asp Gly Ala Ala Leu Arg Arg Thr Ala 
4305 4310 4315 4320 

Lys Glu Arg Gly Ala Ala Thr Met Ser Pro Leu Leu Arg Ala Leu Leu 
4325 4330 4335 

Pro Ala Ala Leu Arg Arg Ser Gly Gly Ala Gly Ala Pro Ala Ala Ala 
4340 4345 4350 

Asp Arg His Gly Lys Glu Ala Asp Pro Gly Ala Gly Arg Leu Ala Gly 
4355 4360 4365 

Met Val Ala Leu Glu Ala Ala Glu Arg Ser Ala Ala Val Leu Glu Leu 
4370 4375 4380 

Val Thr Glu Gin Val Ala Glu Val Leu Gly Tyr Ala Ser Ala Ala Glu 
4385 4390 4395 4400 

He Glu Pro Glu Arg Pro Phe Arg Glu He Gly Val Asp Ser Leu Ala 
4405 4410 4415 

Ala Val Glu Leu Arg Asn Arg Leu Ser Arg Leu Val Gly Leu Arg Leu 
4420 4425 4430 

Pro Thr Thr Leu Ser Phe Asp His Pro Thr Pro Lys Asp Met Ala Gin 
4435 4440 4445 

His He Asp Gly Gin Leu Pro Arg Pro Ala Gly Ala Ser Pro Ala Asp 
4450 4455 4460 

Ala Ala Leu Glu Gly He Gly Asp Leu Ala Arg Ala Val Ala Leu Leu 
4465 4470 4475 4480 

Gly Thr Gly Asp Ala Arg Arg Ala Glu Val Arg Glu Gin Leu Val Gly 
4485 4490 4495 

Leu Leu Ala Ala Leu Asp Pro Pro Gly Arg Thr Gly Thr Ala Ala Pro 
4500 4505 4510 

Gly Val Pro Ser Gly Ala Asp Gly Ala Glu Pro Thr Val Thr Asp Arg 
4515 4520 4525 

Leu Asp Glu Ala Thr Asp Asp Glu He Phe Ala Phe Leu Asp Glu Gin 
4530 4535 4540 

Leu * 
4545 
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(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CGAGGCCGGC GGGCC 



Claims 

20 1 . A DNA molecule comprising an isolated DNA sequence that encodes a tylactone synthase domain. 

2. The DNA molecule as claimed in Claim 1 wherein the isolated DNA sequence is selected from the group consisting 
of: 

nucleotides 942 to 2156, 2571 to 3557, 3675 to 3929, 3993 to 5264, 5631 to 6617, 7410 to 7949, 8220 to 
25 8471, 8541 to 9812, 10260 to 11246, 11319 to 11876, 12861 to 13415, 13719 to 13970, 14411 to 15697, 16055 

to 17122, 17198 to 17794, 18584 to 19138, 19415 to 19666, 20136 to 21404, 21771 to 22757, 23541 to 24077, 
24360 to 24611, 24675 to 25949, 26292 to 27284, 27360 to 27917, 28767 to 29813, 29829 to 30368, 30651 to 
30902, 31337 to 32608, 32975 to 33961, 34694 to 35236, 35492 to 35743, 36360 to 37631, 37989 to 38987, 
39759 to 40313, 40575 to 40826, and 41235 to 41333 all in SEQ ID NO:1. 

30 

3. The DNA molecule as claimed in Claim 1 wherein the isolated DNA sequence is selected from the group consisting 

of: 

nucleotides 942 to 8471, 8541 to 13970, 14411 to 19666, 20136 to 24611 , 24675 to 30902, 31 337 to 35743, 
and 36360 to 40826 all in SEQ ID NO:1. 

35 

4. The DNA molecule as claimed in Claim 1 wherein the isolated DNA sequence is selected from the group consisting 
of: 

nucleotides 816 to 14234, 14351 to 19945, 20010 to 311 99, 31232 to 36067, and 36249 to 41 774 all in SEQ 
IDNO:1. 

40 

5. The DNA molecule as claimed in Claim 1 wherein the isolated DNA sequence is SEQ ID NO:1. 

6. A polypeptide comprising an amino acid sequence that consists of a tylactone synthase domain. 

45 7. The polypeptide of claim 6 wherein the amino acid sequence is selected from the group consisting of: 

(a) amino acids 43 to 447, 586 to 91 4, 954 to 1038, 1060 to 1 483, 1 606 to 1 934, 21 99 to 2378, 2469 to 2552, 
2576 to 2999, 3149 to 3477, 3502 to 3687, 4016 to 4200, and 4302 to 4385 in SEQ ID NO:2; 

(b) amino acids 21 to 449. 569 to 924, 950 to 1148, 1412 to 1596, and 1689 to 1772 in SEQ ID NO:3; 

50 ( C ) amino acids 43 to 465, 588 to 91 6, 1 1 78 to 1 356, and 1 451 to 1 534, 1 556 to 1 980, 2095 to 2425, 2451 to 

2636, 3274 to 3453, and 3548 to 3631 in SEQ ID NO:4; 

(d) amino acids 36 to 459, 582 to 910, 1155 to 1335, and 1421 to 1504 in SEQ ID NO:5; and 

(e) amino acids 38 to 461, 581 to 91 3, 1171 to 1355, 1443 to 1526, and 1663 to 1695 in SEQ ID NO:6. 

55 8. The polypeptide of claim 6 wherein the amino acid sequence is selected from the group consisting of: 

(a) amino acids 1060 to 2552 and 2576 to 4385 in SEQ ID NO:2; 

(b) amino acids 21 to 1772 in SEQ ID NO:3; 
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(c) amino acids 43 to 1534 and 1556 to 3631 in SEQ ID IMO:4; 

(d) amino acids 36 to 1504 in SEQ ID NO:5; and 

(e) amino acids 38 to 1526 in SEQ ID NO:6. 

s 9. The polypeptide of claim 6 wherein the amino acid sequence is selected from the group consisting of SEQ ID NO: 
2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, and SEQ ID NO:6. 

10. A recombinant DNA vector comprising a DNA molecule of Claim 1 . 

io 11. A recombinant DNA vector comprising a DNA molecule of Claim 2. 

12. A recombinant DNA vector comprising a DNA molecule of Claim 3. 

13. A recombinant DNA vector comprising a DNA molecule of Claim 4. 

is 

14. A recombinant DNA vector comprising a DNA molecule of Claim 5. 

15. A recombinant DNA vector of Claim 1 0 which is NRRL B-18688. 
20 16. A recombinant DNA vector of Claim 1 0 which is NRRL B-1 8689 

17. A host cell transformed with a recombinant DNA vector of Claim 10. 

18. A host cell transformed with a recombinant DNA vector of Claim 11 . 

25 

19. A host cell transformed with a recombinant DNA vector of Claim 12. 

20. A host cell transformed with a recombinant DNA vector of Claim 13. 
30 21. A host cell transformed with a recombinant DNA vector of Claim 14. 

22. The DNA molecule as claimed in Claim 1 wherein the isolated DNA sequence is SEQ ID NO:7. 

23. The polypeptide of SEQ ID NO:8. 
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Fig. 6 



N N 
I 

0 



N 
( 



N 
I 



I 

10 



N 
I 

I 

20 



Sf 
i 



N N 
j [_ 

I 

SO 



I 

40 



Wr Sf 
1 



45 



«rmG 



pKClOgO 



PKC1306 



N = Nrul 
Sf =S/ul 



220 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 



Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 



[ST BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 

□ FADED TEXT OR DRAWING 

□ BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHD3IT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 



IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



BEST AVAILABLE IMAGES 




